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FROM THE AUTHOR’S PREFACE TO 
THE FIRST GERMAN EDITION 


HE importance of the standpoint afforded by the theory 

of groups for the discovery of the general laws of 

quantum theory has of late become more and more 
apparent. Since I have for some years been deeply concerned 
with the theory of the representation of continuous groups, it 
has seemed to me appropriate and important to give an account 
of the knowledge won by mathematicians working in this field 
in a form suitable to the requirements of quantum physics. An 
additional impetus is to be found in the fact that, from the 
purely mathematical standpoint, it is no longer justifiable to 
draw such sharp distinctions between finite and continuous 
groups in discussing the theory of their representations as has 
been done in the existing texts on the subject. My desire to 
show how the concepts arising in the theory of groups find their 
application in physics by discussing certain of the more important 
examples has necessitated the inclusion of a short account of the 
foundations of quantum physics, for at the time the manuscript 
was written there existed no treatment of the subject to which 
I could refer the reader. In brief this book, if it fulfills its 
purpose, should enable the reader to learn the essentials of the 
theory of groups and of quantum mechanics as well as the rela- 
tionships existing between these two subjects ; the mathematical 
portions have been written with the physicist in mind, and vice 
versa. I have particularly emphasized the ‘reciprocity ” be- 
tween the representations of the symmetric permutation group 
and those of the complete linear group; this reciprocity has as 
yet been unduly neglected in the physical literature, in spite of 
the fact that it follows most naturally from the conceptual. 
Structure of quantum mechanics. 
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There exists, in my opinion, a plainly discernible parallelism 
between the more recent developments of mathematics and 
physics. Occidental mathematics has in past centuries broken 
away from the Greek view and followed a course which seems 
to have originated in India and which has been transmitted, 
with additions, to us by the Arabs ; in it the concept of number 
appears as logically prior to the concepts of geometry. The 
result of this has been that we have applied this systematically 
developed number concept to all branches, irrespective of whether 
it 1s most appropriate for these particular applications. But 
the present trend in mathematics is clearly in the direction of a 
return to the Greek standpoint ; we now look upon each branch 
of mathematics as determining its own characteristic domain 
of quantities. The algebraist of the present day considers the 
continuum of real or complex numbers as merely one “‘ field ”’ 
among many; the recent axiomatic foundation of projective 
geometry may be considered as the geometric counterpart of 
this view. This newer mathematics, including the modern 
theory of groups and “ abstract algebra,”’ is clearly motivated 
by a spirit different from that of ‘ classical mathematics,’ which 
found its highest expression in the theory of functions of a 
complex variable. The continuum of real numbers has retained 
its ancient prerogative in physics for the expression of physical 
measurements, but it can justly be maintained that the essence 
of the new Heisenberg-Schrédinger-Dirac quantum mechanics is 
to be found in the fact that there is associated with each physical 
system a set of quantities, constituting a non-commutative 
algebra in the technical mathematical sense, the elements of 
which are the physical quantities themselves. 


ZURICH, August, 1928 


AUTHOR’S PREFACE TO 
THE SECOND GERMAN EDITION 


URING the academic year 1928-29 I held a professorship 

in mathematical physics in Princeton University. The 

lectures which I gave there and in other American insti- 
tutions afforded me a much desired opportunity to present anew, 
and from an improved pedagogical standpoint, the connection 
between groups and quanta. The experience thus obtained has 
found its expression in this new edition, in which the subject 
has been treated from a more thoroughly elementary standpoint. 
Transcendental methods, which are in group theory based on 
the calculus of group characteristics, have the advantage of 
offering a rapid view of the subject as a whole, but true under- 
standing of the relationships is to be obtained only by following 
an explicit elementary development. I may mention in this 
connection the derivation of the Clebsch-Gordan series, which is 
of fundamental importance for the whole of spectroscopy and 
for the applications of quantum theory to chemistry, the section 
on the ¥ordan-Holder theorem and its analogues, and above all 
the careful investigation of the connection between the algebra 
of symmetric transformations and the symmetric permutation 
group. The reciprocity laws expressing this connection, which 
were proved by transcendental methods in the first edition, as well 
as the group-theoretic problem arising from the existence of spin 
have also been treated from the elementary standpoint. Indeed, 
the whole of Chapter V—which was, in the opinion of many 
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impossible to avoid presenting the principal part of the theory 
of representations twice; first in Chapter III, where the repre- 
sentations are taken as given and their properties examined, 
and again in Chapter V, where the method of constructing the 
representations of a given group and of deducing their properties 
is developed. But I believe the reader will find this two-fold 
treatment an advantage rather than a hindrance. 

To come to the changes in the more physical portions, in 
Chapter IV the réle of the group of virtual rotations of space 
is more clearly presented. But above all several sections have 
been added which deal with the energy-momentum theorem of 
quantum physics and with the quantization of the wave equation 
in accordance with the recent work of Hetsenberg and Pault. 
This extension already leads so far away from the fundamental 
purpose of the book that I felt forced to omit the formulation 
of the quantum laws in accordance with the general theory of 
relativity, as developed by V. Fock and myself, in spite of its 
desirability for the deduction of the energy-momentum tensor. 
The fundamental problem of the proton and the electron has 
been discussed in its relation to the symmetry properties of the 
quantum laws with respect to the interchange of right and left, 
past and future, and positive and negative electricity. At 
present no solution of the problem seems in sight; I fear that 
the clouds hanging over this part of the subject will roll together 
to form a new crisis in quantum physics. I have intentionally 
presented the more difficult portions of these problems of spin 
and second quantization in considerable detail, as they have 
been for the most part either entirely ignored or but hastily 
indicated in the large number of texts which have now appeared 
on quantum mechanics. 

It has been rumoured that the ‘‘ group pest’”’ is gradually 
being cut out of quantum physics. This is certainly not true 
in so far as the rotation and Lorentz groups are concerned ; 
as for the permutation group, it does indeed seem possible to 
avoid it with the aid of the Pauli exclusion principle. Never- 
theless the theory must retain the representations of the per- 
mutation group as a natural tool in obtaining an understanding 
of the relationships due to the introduction of spin, so long as 
its specific dynamic effect is-neglected. I have here followed the 
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trend of the times, as far as justifiable, in presenting the group- 
theoretic portions in as elementary a form as possible. The 
calculations of perturbation theory are widely separated from 
these general considerations; I have therefore restricted myself 
to indicating the method of attack without either going into 
details or mentioning the many applications which have been 
based on the ingenious papers of Hartree, Slater, Dirac and 
others. 

The constants ¢ and h, the velocitv of light and the quantum 
of action, have caused some trouble. The insight into the 
significance of these constants, obtained by the theory of rela- 
tivity on the one hand and quantum theory on the other, 1s 
most forcibly expressed by the fact that they do not occur in 
the laws of Nature in a thoroughly systematic development of 
these theorics. But physicists prefer to retain the usual c.g.s. 
units—principally because they are of the order of magnitude of 
the physical quantities with which we deal in everyday life. 
Only a wavering compromise is possible between these practical 
considerations and the ideal of the systematic theorist; I 
initially adopt, with some regret, the current physical usage, 
but in the course of Chapter IV the theorist gains the upper 
hand, 

An attempt has been made to increase the clarity of the 
exposition by numbering the formule in accordance with the 
sections to which they belong, by emphasizing the more im- 
portant concepts by the use of boldface type on introducing 
them, and by lists of operational symbols and of letters having 
a fixed significance. 


H. WEYL. 


GOTTINGEN, November, 1930 


TRANSLATOR’S PREFACE 


HIS translation was first planned, and in part completed, 
: during the academic year 1928-29, when the translator 
was acting as assistant to Professor Weyl in Princeton. 
Unforeseen delays prevented the completion of the manuscript 
at that time, and as Professor Weyl decided shortly afterward 
‘to undertake the revision outlined in the preface above it seemed 
desirable to follow the revised edition. In the preparation of 
this manuscript the German has been followed as closely as 
possible, in the conviction that any alterations would but de- 
tract from the elegant and logical treatment which characterizes 
Professor Weyl’s works. While an attempt has been made 
to follow the more usual English terminology in general, this 
programme is limited by the fact that the fusion of branches of 
knowledge which have in the past been so widely separated as 
the theory of groups and quantum theory can be accomplished 
only by adapting the existing terminology of each to that of 
the other; a minor difficulty of a similar nature is to be found 
in the fact that the development of “ fields ’’ and “ algebras ”’ 
in Chapter V is accomplished in a manner which makes it appear 
desirable to deviate from the accepted English terminology. 
It is a pleasure to express my indebtedness to Professor Weyl 
for general encouragement and assistance, to Professor R. E. 
Winger of Union College for the assistance he has rendered in 
correcting proof and in preparing the index, and to the publishers 
for their codperation in adhering as closely as possible to the 
original typography. 
H. P. ROBERTSON 


PRINCETON, September, 1931 
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INTRODUCTION 


Hike quantum theory of atomic processes was proposed by 
NieELts Bour in the year 1913, and was based on the 
atomic model proposed earlier by RUTHERFORD. The 
deduction of the Balmer serics for the line spectrum of hydrogen 
and of the Rydberg numbe~ from universal atomic constants 
constituted its first convincing confirmation. This theory gave 
us the key to the understanding of the regularities observed in 
optical and X-ray spectra, and led to a deeper insight into the 
structure of the periodic system of chemical elements. The issue 
of Naturwissenschaften, dedicated to Bour and entitled ‘ Die 
ersten zchn Jahre der Theorie von Niets Bohr tiber den Bau 
der Atome”’ (Vol. 11, p. 535 (1923)), gives a short account of the 
successes of the theory at its peak. But about this time it began 
to become more and more apparent that the Bonr theory was 
a compromise between the old ‘‘classical’’ physics and a new 
quantum physics which has been tn the process of development 
since Planck’s introduction of energy quanta in 1900. BoHR 
described the situation in an address on ‘“‘ Atomic Theory and 
Mechanics’’ (appearing in Nature, 116, p. 845 (1925)) in the 
words: ‘‘From these results it seems to follow that, in the 
general problem of the quantum theory, one is faced not with 
a modification of the mechanical and electrodynamical theories 
describable in terms of the usual physical concepts, but with 
an essential failure of the pictures in space and time on which 
the description of natural phenomena has hitherto been based.”’ 
The rupture which led to a new stage of the theory was made 
by HEIsENBERG, who replaced Bohr’s negative prophecy by a 
positive guiding principle. 
The foundations of the new quantum physics, or at least 


its more important theoretical aspects, are to be treated in this 
XIX 
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book. For supplementary references on the physical side, 
which are urgently required, I name above all the fourth edition 
of SOMMERFELD’s well-known “ Atombau und Spektrallinien ”’ 
(Braunschweig, 1924), or the English translation ‘‘ Atomic 
Structure and Spectral Lines’’ (London, 1923) of the third 
edition, together with the recent (1929) ‘‘ Wellenmechanischer 
Erganzungsband ”’ or its English translation ‘‘ Wave Mechanics ”’ 
(1930). An equivalent original English book is that of Ruark 
AND Urey, ‘‘ Atoms, Molecules and Quanta ’’ (New York, 1930), 
which appears in the “ International Series in Physics,’ edited 
by RIcHTMEYER. I should also recommend GER LAcuH’s short 
but valuable survey ‘‘ Fxperimentelle Grundlagen der Quanten- 
theorie’’ (Braunschweig, 1921). The spectroscopic data, pre- 
sented in accordance with the new quantum theory, together 
with complete references to the literature, are given in the 
following three volumes of the series ‘‘Struktur der Materie,”’ 
edited by Born AND FRANCK :— 

F. Hunp, ‘‘Linienspektren und periodisches System der 
Elemente ’’ (1927); 

E. Back anp A. Lanpk, ‘‘Zeemaneffekt und Multiplett- 
struktur der Spektrallinien’’ (1925) ; 

W. Grotrian, ‘‘Graphische Darstellung der Spektren von 
Atomen und Jonen mit ein, zwei und drei Valenzelektronen ”’ 
(1928). 

The spectroscopic aspects of the subject are also discussed 
in PAULING AND GoupsmitT’s recent ‘‘The Structure of Line 
Spectra ’’ (1930), which also appears in the “ International 
Series in Physics.”’ 

The development of quantum theory has only been made 
possible by the enormous refinement of experimental technique, 
which has given us an almost direct insight into atomic 
processes. If in the following little is said concerning the 
experimental facts, it should not be attributed to the mathe- 
matical haughtiness of the author; to report on these things 
lies outside his field. Allow me to express now, once and for 
all, my deep respect for the work of the experimenter and for 
his fight to wring significant facts from an inflexible Nature, 
who says so distinctly ‘‘No” and so indistinctly ‘“‘ Yes”’ to 
our theories, 
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Our generation is witness to a development of physical 
knowledge such as has not been seen since the days of KEPLER, 
GALILEO AND Newton, and mathematics has scarcely ever 
experienced such a stormy epoch. Mathematical thought 
removes the spirit from its wor'dly haunts to solitude and 
renounces the unveiling of the secrets of Nature. But as 
recompense, mathematics is less bound to the course of worldly 
events than physics. While the quantum theory can be traced 
back only as far as 1900, the origin of the theory of groups 
is lost in a past scarcely accessible to history; the earliest 
works of art show that the symmetry groups of plane figures 
were even then already known, although the theory of these 
was only given definite form in the latter part of the eighteenth 
and in the nineteenth centuries. . KLrin considered the 
group concept as most characteristic of nineteenth century 
mathematics. Until the present, its most important application 
to natural science lay in the description of the symmetry of 
crystals, but it has recently been recognized that group theory 
is of fundamental importance for quantum physics; it here 
reveals the essential features which are not contingent on a 
special form of the dynamical laws nor on special assumptions 
concerning the forces involved. We may well expect that it 1s 
just this part of quantum physics which is most certar of a 
lasting place. Two groups, the group of rotations in 3-dimen- 
sional space and the permutation group, play here the principal 
role, for the laws governing the possible electronic configurations 
grouped about the stationary nucleus of an atom or an ion are 
spherically symmetric with respect to the nucleus, and since the 
various electrons of which the atom or ion 1s composed are 
identical, these possible configurations are invariant under a 
permutation of the individual electrons. The investigation of 
groups first becomes a connected and complete theory in the 
theory of the representation of groups by linear transformations, 
and it is exactly this mathematically most important part 
which is necessary for an adequate description of the quantum 
mechanical relations. lll quantum numbers, with the exception 
of the so-called principal quantum number, are indices character- 
1etg representations of groups. 
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This book, which is to set forth the connection between groups 
and quanta, consists of five chapters. The first of these is 
concerned with unitary geometry. It is somewhat distressing 
that the theory of linear algebras must again and again be 
developed from the beginning, for the fundamental concepts 
of this branch of mathematics crop up everywhere in mathe- 
matics and physics, and a knowledge of them should be as 
widely disseminated as the elements of differential calculus. 
In this chapter many details will be introduced with an eye 
to future use in the applications; it 1s to be hoped that in 
spite of this the simple thread of the argument has remained 
plainly visible. Chapter II is devoted to preparation on the 
physical side; only that has been given which seemed to me 
indispensable for an understanding of the meaning and methods 
of quantum theory. A multitude of physical phenomena, which 
have already been dealt with by quantum theory, have been 
omitted. Chapter III develops the clementary portions of the 
theory of representations of groups and Chapter IV applies them 
to quantum physics. Thus mathematics and physics alternate 
in the first four chapters, but in Chapter V the two are fused 
together, showing how completely the mathematical theory 1s 
adapted to the requirements of quantum physics. In this last 
chapter the permutation group and its representations, together 
with the groups of lincar transformations in an affine or unitary 
space of an arbitary number of dimensions, will be subjected to 
a thorough going study. 


THE THEORY OF GROUPS AND 
QUANTUM MECHANICS 


CHAPTER I 
UNITARY GEOMETRY 
§14. The a-dimensional Vector Space 


HE mathematical field of operation of quantum mechanics, 

as well as of the theory of the representations of groups, 

is the multi-dimensional affine or unitary space. The _ 
axiomatic method of developing the geometry of such a space 
is no doubt the most appropriate, but for the sake of clearness 
I shall at first proceed along purely algebraic lines. I begin 
with the explanation that a vector x in the n-dimensional 
linear space R = R,, is a set of n ordered numbers (%,, %2 + * *, Xn); 
vector analysis 1s the calculus of such ordered sets. The two 
fundamental operations of the vector calculus are the multiplica- 
tion of a vector x by a number a and the addilion of two vectors x 
andy. On introducing the notation 


t= (%1, Xa, °° %y Xn), )= (Va, ey ee Vn) 


these operations are defined by the equations 


ag aes (ax, AXe, ° °°, aX), E 5 y ae (%4 i Vi) %2 oT Ya °° °y 
Xn + Yn)- 


The fundamental rules governing these operations of multiplica- 
tion by a number and addition are given in the following table 
of axioms, in which small German letters denote arbitrary 
vectors and small Latin letters arbitrary numbers : 


(a) Addition. 


l.a+b=b-+ a (commutative law). 

2. (a+ b) +c=a-+ (b+ ¢) (associative law). 

3. a and c being any two vectors, there exists one and only one 
. vector ¢ for whicha+x=c. It is called the difference ¢c — a of 
cand a (possibility of subtraction). 
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(8) Multiplication. 


1. (a + b)x = (az) + (br) (first distributive law). 
2. cap == (ab)x (associative law). 
: ] 


t= 
a(t + ») ) == (ax) + (ah) (second distributive law). 
The existence of a vector 0 = (0, 0, - + *, 0) with the property 
r+ 0=0+2=2 


need not be postulated separately as it follows from the axioms. 

Affine vector geometry concerns itself entirely with concepts 
which are defined in terms of the two fundamental operations 
with which the axioms («) and (f) are concerned ; we mention 
a few of the most important. A number of vectors Q,, a9, ° °°, Q, 
are said to be linearly independent if there exists between them 
no homogeneous linear relation 


. C40) + C90, + °° * + C,0, = O 
except the trivial one with coefficients 


¢+,.=0, = 90, 3 ¢, = 0. 


h such vectors are said to span an h-dimensional (linear) sub= 
space §’ consisting of all vectors of the form 


E= &,0,; + fea, +> > > + Ena, 1.1) 


where the €’s are arbitrary numbers. It follows from the 
fundamental theorem on homogeneous linear equations that 
there exists a non-trivial homogeneous relation between any 
h-+ 1 vectors of R’. The dimensionality h of R’ can therefore 
be characterized independently of the basis: every h + 1 vectors 
in ’ are linearly dependent, but there exist in it h linearly 
independent vectors. Any such system of h independent 
vectors Qj, A, ° * *, a, 1n R’ can be used as a co-ordinate system 
or basis in ®t’ ; the coefficients &,, &,° + +, €, in the representation 
(1.1) are then said to be the components of x in the co-ordinate 
system (Qj, M2, ° * *, Qa). 
The entire space ® is »-dimensional, and the vectors 
e, = (1, 0, 0, ~» +, 0), 
ie (0, I, 4. . 0), (1.2) 
= (0, 0, 0, see ]) 
define a co-ordinate system in it in which the components of a 
vector 
E oar (4, 4s ° °°, xy) 
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agree with the “ absolute components ”’ z, : 
Uo yy + elo tte t+ Ten. 


From the standpoint of affine geometry, however, the ‘‘ absolute 
co-ordinate system ’’ (1.2) has no’preference over any other which 
consists of m independent vectors of . We now add to the 
previous axioms, which did not concern themselves with the 
dimensionality n, the following dimensionality axiom : 

(y) The maximum number of linearly independent vectors in R 
isn. 

These axioms (a), (8), and (y) suffice for a complete formula- 
tion of vector calculus, for if €,, €2,* * *, €, are any independent 
vectors and x 1s any other vector there must necessarily exist 
a linear dependence 


at + aye; + ae, + °° + + ae, = 0 


between them. Since not all the coefficients may vanish we 
must in particular have a +0, and consequently any vector r 
can be expressed as a linear combination 


Uo Xl + Xlke te + Xnln (1.3) 


of the ‘‘ fundamental vectors ’’ e@,, €2,° * *, €,. We specify x by 
the set (x1, X%2,° * *, X,) of components in this co-ordinate system. 
In accordance with axioms (a) and (f) for addition and multi- 
plication we then have for any two vectors (1.3) and y 


ay=(ax,)ey + (ae (ax n)En, tt+H)=(xy+yi)ei+ en (tntVnlen 


and we arrive at the definitions from which we started. The 
only—but important—difference between the arithmetic and 
the axiomatic treatment is that in the former the absolute co- 
ordinate system (1.2) is given the preference over any other, 
whereas in the latter treatment no such distinction is made. 

Given any system of vectors, all vectors ¢ which are obtained, 

s (1.1), by linear combinations of a finite number of vectors 
Q,, Qe, * * *, a, of the system constitute a (linear) sub-space—the 
sub-space ‘‘ spanned "’ by the vectors a. 

R is said to be decomposed or reduced into two linear sub- 
spaces f’, RK’ (R= R' + MR’) if an arbitrary vector ¢ can be 
expressed uniquely as the sum of a vector r’ of Rt’ and a vector 
r’’ of R’’. A co-ordinate system in ’ and a co-ordinate system 
in R” constitute together a co-ordinate system for the entire 
space ; this co-ordinate system in ®t is ‘‘ adapted’’ to the 
decomposition R’ + R”. The sum n’ +n” of the dimension- 
alities of R’ and RR” is equal to m, the dimensionality of R. 
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Conversely, if the sub-spaces §’, R’’ have no vector except 0 
in common, and if the sum of their dimensionalities is , then 
KR a Re’ +. RK. 

§’ being an n‘dimensional sub-space, two vectors x and by are 
said to be congruent modulo — : . 


¢ = h (mod. ®)), 


if their difference lies in §’. Congruence satisfies the axioms 
postulated of any relation of equality : every vector is congruent 
to itself; if r = 9 (mod. f’) then » =z (mod. ®’); ifr =yh 
(mod. §’) and 9 =%(mod. §’), then x =3 (mod. ff’). It is 
therefore permissible to consider vectors which are congruent 
mod. §’ as differing in no wise from one another; by this ab- 
straction, which we call projection with respect to R’, the 
n-dimensional space ®t gives rise to an (nm — n’)-dimensional 
space ®. is also a vector space, for from 


ti =e, 0, = Ye (mod. #’) 
follow the relations 


at, = ate, Tr t+ Yi = La t+ Yeo (mod. R’). 


The operations of multiplication by a number and addition can 
therefore be considered ones which operate directly on the 
vectors r of Rt. All vectors x of R which are congruent mod. R’ 
give rise to the same vector x of ft. If R’ is one-dimensional 
and is spanned by e the above process is the familiar one of 
parallel projection in the direction of e; it is not necessary to 
give an (m — 1)-dimensional sub-space of on to which the 
projection is made. 

If a is a non-null vector, all vectors r which arise by multi- 
plying a by a number are said to lie on the same ray asa. Two 
non-null vectors determine the same ray when, and only when, 
one is a multiple of the other. In a given co-ordinate system 
the vector a is characterized by its components a), dy, °° *, a, 
whereas the ray a is characterized by their ratios a,: a@g:° °°: ay; 
these ratios have meaning only when the components of a do 
not all vanish, 1.e. only when a =+ 0. 

The transition from one co-ordinate system e, to another e,’ is 
accomplished by expressing the new co-ordinate vectors e,’ in 
terms of the old: 


. n 
/ 
C= ati e;. 
t= 
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If x,, x,{ are the components of an arbitrary vector r in the old 
and in the new co-ordinate systems, respectively, then 


c= 24, ey = A Xue 
t 


from which the law of transformation 


n 


Hy = Ly XR (1.4) 
k=1 


follows. The requirement that the co-ordinate vectors e,’ also 
be linearly independent is expressed arithmetically by the non- 
vanishing of the determinant of the coefficients a,,. The com- 
ponents of vectors fz, ),: - - in ® undergo the same transformation 
on transition to the new co-ordinate system e,’ and are said to 
transform cogredtently. 


§2. Linear Correspondences. Matrix Calculus 


The formula (1.4) can, however, be otherwise interpreted ; 
it is the expression of a Jinear or affine correspondence or 
mapping of the space ® on itself. But for this purpose it 
will be found more convenient to interchange the roles of the 
accented and the unaccented co-ordinates. On employing a 
definite co-ordinate system e,;, the equation 


n 
are = d) Qk Nk (2.1) 

‘ k=1 
associates with an arbitrary vector x with components 4, a vector 
t’ with components x,’.. This correspondence A:r > Yr’ of Ron 
itself can be characterized as linear by the two assertions: if 
t, ) go over into x’, 9’, then ax goes over into ay’ and ¢ + y into 
r’-+ ’. Linear correspondences therefore leave all affine rela- 
tions unaltered; hence their prominence in the theory of affine 
geometry. In order to show that these two conditions fully 
determine the linear correspondence (2.1), consider the following : 
if a correspondence A which satisfies these conditions sends the 

fundamental vector e, over into 


Ce, = 21x e; (2.2) 
then, in consequence of the above requirements, 
ESM ter tt iHen 
goes over into 
go 4% Oy to + + in On. 
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On substituting (2.2) in this equation we see that the new vector 
t’ has in the co-ordinate system e, the components x,’ obtained 
from the components x, of tr by means of (2.1). It has become 
customary in quantum physics to call the linear correspondences 
of a vector space ® operators which operate on the arbitrary 
vector r of R. 

Let A, B be two linear correspondences, the first of which 
sends the arbitrary vector x over into y’ = Ar, while the second 
sends ry’ into y’’ = Br’ = B(Ar). The resultant correspondence 
C, which carries x directly into r’’, is also linear and is denoted 
by (BA) (to be read from right to left !) : 


(BA)t = B(Ar). 


This ‘‘ multiplication ’’ satisfies laws which are similar to those 
of multiplication of ordinary numbers; in particular, the as- 
sociative law 

C(BA) = (CB)A 


is here valid, but the commutative law is not—in general 
AB =+ BA. The‘ 1” in this domain, which we here denote by 
1, is the identity, i.e. that correspondence which associates every 
vector f with itself: ry. Hence 


AL=1A= A. 


The correspondence A is then and only then reversible in case 
it is non-degenerate, 1.e. if it carries no non-vanishing vector into 
the vector 0, or if distinct vectors are always carried over into 
distinct ones. The algebraic condition for this is the non- 
vanishing of the determinant | ix| = det A; there then exists 
the inverse correspondence A™?: 


AA = 47 A Ssh, 
The multiplication theorem for determinants states that 
det (BA) = det B- det A. 


6 


Not only can we ‘ multiply ’’ two correspondences, we can 
also ‘‘add’’ them. This concept of addition arises quite natur- 
ally: if the arbitrary vector x is sent over into x,’ by A and-into 
ft. by B, then that correspondence which sends fg into x,’ + 2X2’ is 
also linear and is denoted by A + B: 


(A + Bly = Ag + Be. 
We may also introduce multiplication by an arbitrary number 
a: aA is that correspondence which sends ¢ into a(Ar). Addition 
and multiplication by a number obey the same laws as the 
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analogous operations on vectors. Addition is commutative, 
and has as its inverse subtraction. The réle of 0 is played by 
the correspondence 0 which transforms every vector x into the 
vector 0. Addition obeys the distributive law with respect to 
multiplication : 


(A+ B)C = AC+ BC, | C(A+ B)=CA+ CB, 
(aA)C = a(AC), C(aA) = a(CA). 

Before proceeding to the arithmetical expression of these 
operations in a given co-ordinate system, we consider another 
natural generalization. We can map an m-dimensional vector 
space ®t linearly on an »-dimensional space ©; this is accom- 
plished when with each vector zr of ft a vector t) of © is associated 
in such a way r > 9 that from x, — hy, Lz > Ye it follows that 


ay, > ay, %1 + le > 91 + Ye 
Such a correspondence A:z->¥ is expressed by equations of 
the form 
iS ae PiDns Xt (f = 12°, n) (2.3) 


where x,, * * *, ¥ are the components of f£ in a given co-ordinate 
system in the space ® and y,, °° +, ¥, have the corresponding 
interpretation in ©. With this correspondence A there is 
associated the matrix 


Ay, Ay2- - » Aim 
: Aq, Gon - « - Gam 
Any Ang - - nm 


with m rows and m columns, and which we also denote by 
the same letter A. The first index indicates the row and the 
second the column to which a,;, belongs. We can also add corre- 
spondences of the same space §t on the same space ©. Addition 
and multiplication by a number ts accomplished on matrices by 
subjecting their 2 +m components to these operations: if 
A= |[4e: || and B= || d;, || 
then 
aA = |la- a,; ||, A+ B= || ay; + ,: |]. 

If we have a third (p-dimensional) vector space Y, the consec- 
utive application of the correspondences A: x — y of on © and 
B:y > 30f Gon gives rise to the correspondence C = BA: — 3 
of R on TY. This composition is expressed in terms of matrix 


components by the law 
— $= 1,2,% 9%, p 
C1 = Zo utes ic i 5 ae a (2.4) 
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B has p rows and n columns and A rows and m columns; the 
composition of matrices is possible when the first factor B has 
the same number of columns as the second factor 4 has rows. 
The component or element c,;, which is found at the intersection 
of the / row and the 7 column, is formed in accordance with 
(2.4) from the components in the J row of B and the 7 column 
of A. An important special case is that in which & is the same 
space as R; A is then a correspondence of R on G, B of SG on R. 
Already here concepts of the theory of groups play an important 


O---0 


role; on beginning Chapter III, which deals with the theory of 
groups, the reader should return to the matter here discussed 
as an illustration. 

The matrix calculus allows us to express the formule for 
a linear correspondence, such as (2.3), in an abbreviated form. 
We do this by denoting by x that matrix whose only column 
consists of the vector components %, %2, °° *, Xj similarly 
for y. In accordance with the rule (2.4) for the composition of 
matrices, equations (2.3) can be written 


y = Ax, (2.5) 


LINEAR CORRESPONDENCES 9 


This form is particularly useful in examining the effect on the 
matrix A of a linear correspondence of a space ft on a space © 
when the original co-ordinate systems are replaced by new ones. 
If this change of co-ordinates is effected by the transformations 
= DV syX;. Or X= Sx inh, 
j 
VYe= Dtenvn or y= Ty’ ing, 
h 
then from (2.5) 


Ty’ = ASx’ or yp’ = (TTAS)x’. 


FIG. 2. 


The same correspondence in the new co-ordinates is therefore 
expressed by the matrix 


MPAs. (2.6) 


Let us now return to the linear correspondence A of a space 
R on to itself. If R’ is a linear n’-dimensional sub-space of K* 
we say that A leaves R invariant if it carrics any vector of WH, 
over into a vector of Rt’. If the co-ordinate system is so chosen 
that the first »’ fundamental vectors lic in RR’, the matrix of 
a correspondence which leaves §’ invariant will assume the 
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form given by Fig. 1. All elements in the rectangle of n’ columns 
and n — n’ rows denoted by zeros in Fig. 1, vanish. A contains 
a correspondence of §R’ on to itself and at the same time a corre- 
spondence of the space §, arising by projecting R with respect 
to R’, on to itself. The matrices of these correspondences con- 
sist in the shaded squares. If 8 is decomposed into R, + R, 
(ny + N.= n), and if the correspondence A leaves both sub- 
spaces $f, and §t, invariant, then A is completely reduced 
into a correspondence of §, on itself and a correspondence of 
Ft, on to itself. If the co-ordinate system is adapted to the 
decomposition Rt, + Ji., the matrix A is completely reduced into 
two square matrices arranged along the principal diagonal as 
in Fig. 2. The unshaded rectangles are empty—the elements 
situated in these portions are all zero. 

Let the n-dimensional linear space be decomposed into 
sub-spaces Rt, + Mt, -+ °° +, Rx having the dimensionality a; 7 is 


then equal to the sum 7,-+ ,+ :° +. Any vector yz can then be 
written uniquely as the sum of components x, +2, + +: + which 
lie in the sub-spaces f,, ¥., °° °. The association ~ — Ya Is 


a linear correspondence Ea of ® on to fa. Given a correspond- 
ence A:r— vr’ of St on to itself, we consider that linear corre- 
spondence [A]ag which carries an arbitrary vector z of tg, over 
into the component fa’ in Ra of x’. We call [Alag the portion of 
A in which Rta intersects Rg. This terminology arises from the 
matrix representation of A; on adapting the co-ordinate system 
to the decomposition ®, + R, +--+ the set of variables x;, or 
rather their indices 1 which number the rows and columns of 
the matrix, is broken up into segments of lengths na (a = 1, 2,-°-°). 
The matrix A is thereby divided into the single rectangles 
[A]ag in which the «" set of rows intersects the 6 set of columns, 
and which consist of na +g elements. 

If A is the matrix of a correspondence of R on to itself in 
a given co-ordinate system, and JA’ its matrix in a co-ordinate 
system obtained from the first by means of the reversible 
transformation S, then in accordance with (2.6) 


A’ = SOAS. (2.77) 


The search for an invariantive characterization of correspondences 
may be formulated algebraically: to find expressions which 
are so formed from the components of an arbitrary matrix that 
they assume the same value for equivalent matrices, 1.e. for 
matrices A, A’ between which a relation (2.7) exists. The way 
in which this can be accomplished is indicated by the related 
problem of finding a vector x +0 which is transformed into 
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multiple Ar of itself under the influence of A. The column x 
f the components of x must then satisfy the equation 

Ax = Ax, or (Al — A)x = 0. 

3ut n linear homogeneous equations in m unknowns have a 
on-vanishing solution only if their determinant vanishes; the 
iultiplier A is therefore necessarily a root of the “ characteristic 
volynomial ”” 


f(A) = det (Al — A) (2.8) 
f A. This polynomial is an invariant in the above sense, for 
rom (2.7) or SA’ = AS it follows that 
S(A1 — A’) = (AL — A)S, 


vhence by the theorem concerning the multiplication of deter- 
ninants 


det S- det (Al — A’) = det (Al — A) - det S. 


since the determinant of the reversible transformation S cannot 
ranish, we can divide by it and obtain the required identity 


(Al — A’| = |A1— Al. 
[he characteristic polynomial is of degree 7 in A: 
fA) = AW — sw IH eS, 

vhose coefficients, certain integral functions of the elements 
tj,, are invariants of the correspondence A. The “norm” s,, 
s merely the determinant of A. The first coefficient s,, the 
‘race 

$y = Ay + Gog + °° * + Gan = tra (2.9) 
s of more importance, as it depends linearly on the a,, : 

tr(A, -{- Ag) = trA, -|- trA,. 


If A is a linear correspondence of the m-dimensional vector 
space ® on the n-dimensional space ©, and B is conversely a 
linear correspondence of © on WR, then we can build the corre- 
spondences BA of KR on to itself and AB of © on to itself. These 
[wo correspondences have the same trace 


tr(BA) = tr(AB) (2.10) 


or, in accordance with the rule of composition (2.4) and the 
definition (2.9) we have 


tr(B A) — 2,0 Any. tr( AB) ae 2445 Oy 
i,k i,k 
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where z runs from 1 to m and k from 1 to n. The special case 
in which A and B are both correspondences of R on to itself 
naturally deserves particular consideration. 


§3. The Dual Vector Space 
A function L(g) of the arbitrary vector x of the form 


y%y Keg tees + Oty (3.1) 


is called a linear form. This concept is invariant in the sense of 
affine geometry: it can be defined by means of the functional 
properties 


Log) = a+ L(x), Lie + 9) = Liz) + L(y). 


It is obvious that the expression (3.1) has these properties, and 
conversely, on introducing a co-ordinate system e; and setting 
r= 2'x,¢,, it follows that 


L(x) = PEF L(e;) = D, Xi, Lg = L(e,). 


On going over to another co-ordinate system such that the 
components x, of an arbitrary vector y undergo the transforma- 
tion (1.4), the linear form becomes 


Oi == Lax, 


the coefficients a,’ of which are related to the original «; by the 
equations 
on = 2 dix * Oye 
v 


The coefficients a; of a linear form are said to transform contra 
grediently to the variables x;. 

It is, however, not necessary to consider the «; as constants 
and the x, as variables. When the «,; do not all vanish the equa- 
tion Lig) = 0 defines a “ plane,” i.e. an (n — 1)-dimensional 
sub-space ; a vector £ lies in the plane if its components satisfy 
this equation. But on the other hand we can ask for the equation 
of all planes which pass through a given non-vanishing vector x° ; 
the x, = x,° are then constants and the «, variables. It is there- 
fore most appropriate to consider the two sets (x, %2,° * *, Xn), 
(a1, %2, °° *, &,) in parallel. 

We therefore introduce in addition to the space Jt a second 
n-dimensional vector space, the dual space P. From the com- 
ponents (&,, &, ° °°, &,) of a vector € of P and a vector 
(%1, %2,° * *, X,) of Ji we can construct the inner or scalar product 


EX + Sexe +e t+ Ente (3.2) 
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This product has, by definition, an invariantive significance, for 
when & 1s referred to a new co-ordinate system by means of 
a transformation of the x; the variables &; of the dual space P 
undergo the contragredient transformation. This dual space is 
in fact introduced in order to enable us to associate a contra- 
gredient transformation with each one-to-one transformation. 
To repeat, two linear reversible transformations 


x= Ax’, E= AL’ (3.3) 


are contragredient with respect to each other if they leave (3.2) 
unaltered : 


E,X, + E,X9 + caeaeas +- ee E,/%y + E,'X%_ + mae. +- eave. (3.4) 


A vector ¢ of ® and a vector € of P are said to be in involution 
when their product (3.2) vanishes. A ray in Rt determines a 
plane in P, i.e. the plane consisting of the vectors which are in 
involution with the given ray, and conversely. Duality is 
a reciprocal relationship.f 


The dual or transposed matrix A* of a matrix A = |la,,|| 
is obtained by interchanging the rows and columns of A. 
A* = |laj|| is therefore defined by aj, = a,;, and has m rows 


and ~ columns. We shall always employ the asterisk to in- 
dicate this process. And what is its geometrical interpretation ? 
Let ® be an m-dimensional, © an n-dimensional, vector space ; 
A: >) a linear correspondence of R on G, specified in terms 
of given co-ordinate systems in ® and © by the matrix A: 


Ve = Ln %, 
and let P, 2 be the dual spaces. The product 
2% Ve a * Oni NX = LE; %)), 


where 7 1s an arbitrary vector of 2’ with components 7,;, has then 
an invariantive significance. A bilinear form which depends 
linearly on a vector y of 2S and a vector g of R is therefore in- 
variantively associated with a linear correspondence of ® on ©, 
and conversely. This gives rise, as the expression of the bi- 
linear form given in parentheses shows, to a correspondence 


n> é: c= Lani 


of 2’on P, i.e. the dual A* of A. The reciprocal relation existing 
between the correspondence A and its dual A* may be expressed 


t In the theory of relativity it is usual to call vectors in and P contra- 
vartant and covariant vectors, respectively. 
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as follows: if xis an arbitrary vector in ® and 7 is an arbitrary 
vector in 2, then the product of the vectors Ay and 7 1s equal 


to the product of y and A*n. The dual correspondences obey 
the linear laws 


(A, + A,)* = A,* + A,*, (aA)* = a> A*. 


If A is a correspondence of & on © and B a correspondence of 
© on &, then since 


(BA)* = A*B* (3.5) 


BA maps ® linearly on Z, and A*B* maps the dual space T 
of Y on the dual P of MR. 

We have agreed once and for all to consider the set 
X1, Xe, ° * *, X, Of components of a vector r as a column; the 
inner product of the vector x in Rt with the vector € in P can 
therefore be written in matrix notation as &*x or x*& The 
transformations (3.3), from the first of which it follows that 
x* = x'*A*, are consequently contragredient to one another if 


A*A=1 or A= (A*)-} (3.6) 


and we have arrived at an explicit expression for the contra- 
gredient transformation. 

Let ’ be an n’-dimensional sub-space of R= R,. All 
vectors of P which are in involution with the totality of vectors 
of ft’ obviously constitute, in consequence of the simplest 
theorems on linear homogeneous equations, an (m — n‘)-dimen- 
sional sub-space P’ of P. And from this we are led immediately 
to the result that 2f a correspondence A of Mt on itself leaves the 
sub-space Rt’ invariant, then the dual correspondence A* of P on 
itself leaves the associated sub-space P" invariant. 

Let %& be decomposed into two or more sub-spaces 
R, + R, +: +--+ of dimensionalities 2,, nv, - ++, and let the 
sub-space of P which consists of all vectors in involution with 
all vectors of R, + RM, -+- - + be denoted by P,, the dimension- 
ality of which 1s also ,. Defining P,, P; analogously, we arrive 
at the decomposition P = P,; + P,+:-: +, for the sum of a 
vector of P, a vector of P,, etc., can only be. zero when each 
of the individual summands vanishes. In order to prove this 
latter statement, we note that if the sum is 0 then the first 
summand belongs to P, as well as to P, + P3;+- ° :, Le. it is 
in involution with all the vectors of #, + 8, + °°: as well as 
with all those of ,, and is therefore in involution with all the 
vectors of #. But this is only possible if this first, and therefore 
any, summand Is zero. P, can be considered as the space dual 
to ij, for if y is an arbitrary vector in R, and y a vector in P 
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with components 7'*) in the various P,, then the product of 
4 and 7 is equal to the product of x and 7"), 

If a correspondence A of ft on itself leaves the n’-dimensional 
sub-space ft’ invariant, then the (x — n’)-dimensional sub-space 
P’ is invariant under the dual correspondence A* of P on itself. 
If R is decomposed into R, + R, ++ + + and if A leaves each 
of the sub-spaces ft, invariant, then A* leaves each of the sub- 
spaces P, invariant. If A is any correspondence in R and [A]., 
that portion in which ®, intersects Rs, then the portion [A*],, 
of A* in which Pg intersects P, is dual to [A]qg: 


[A] pa = [A] "ap. (3.7) 

[A]1g maps R, on RK, and [A*],, maps the dual space P, on Py. 

All these results are conceptually evident, but can be seen 

even more readily directly from the matrices on adapting the 
co-ordinate system to the decomposition ®, + R,+ °°: 


§4. Unitary Geometry and Hermitian Forms 


The metric is introduced into affine geometry by means of 
a new fundamental concept: the absolute magnitude of a vector. 
In Euclidean geometry the sum of the squares 


Ue ey + Xe ft + x, (4.1) 


of the components of a vector ¢ = (1, %2, °° *, ¥,) is taken as 
the square of its absolute value. The only co-ordinate systems 
which are then equally permissible are the Cartesian systems, 
in which the square of the absolute value of x is given by (4.1) 
in terms of the components x,; the range of values which the 
components may here assume ts taken as the continuum of all 
real numbers. But the content of the preceding paragraphs 
is not bound to this choice; the only requirement is, in fact, 
that the range of permissible values constitute a “ field’’ in 
which the four fundamental operations (excluding division by 
zero) can be performed. We shall hereafter consider the con- 
tinuum of all complex numbers as the range of values which our 
components may assume. The expression (4.1) loses its definite 
character in this domain; the sum of the squares can vanish 
without implying that each term is zero. It is therefore desirable 


to replace the quadratic form (4.1) by the ‘ unit Hermitian 
form ”’ 


HX Zep + ° + * + Ey, (4.2) 


where Z denotes the complex conjugate of a number x. The 
value zr? of (4.2) will be taken as the square of the absolute 
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magnitude of the vector ¢ = (%,, %2,°**, %,) and the correspond- 
ing bilinear form 


4 (9) = XV. + Xe tet + XAVn 

as the scalar product (rn) of the two vectors r and h = 
(V1, Ye, °° *, Yn) A co-ordinate system is said to be normal 
when the square of the absolute magnitude of a vector f£ is 
expressed in terms of its components x, in this co-ordinate 
system by (4.2). In a normal co-ordinate system e, these 
components are the scalar products 


x; = (€:t). (4.3) 
The transformations which lead from one normal co-ordinate 
system to another such, which therefore leave the form (4.2) 
invariant, are called unitary transformations.t 
The conditions which characterize unitary transformations 
are entirely analogous to those for orthogonal transformations, 
with which we are familiar from the elements of analytic geo- 
metry. Let x = Sx’ be such a transformation; under the 
influence of S the fundamental metric form (4.2) goes over into 
a'*S*Sx’. S is therefore unitary if and only if S*S =1; the 
fact that det S +0 follows immediately from this. Indeed, 
since a matrix S and its transposed S* have the same deter- 
minant, it follows that the determinant of a unitary transformation 
has the absolute value 1: \det S|*= 1. These conditions may 


be expressed by the assertion that S* is the matrix S~! reciprocal 


to S, and therefore not only S*S = 1 but also SS* =: 1. The 
first of these equations states that the sum of the squares of 
the absolute values of the elements of a column is | and that 
the sum of the mixed products 25 5iSre of two different columns 


(i +k) is 0; the second equation contains the same assertion 
for the elements of the rows. 

We carry over the terminology usual in Euclidean geometry. 
In particular, the vector 1s said to be perpendicular to x if 
the scalar product (ry) vanishes. In virtue of the symmetry law 


cen 


(yx) = (ry) 


perpendicularity is a reciprocal relationship. There exists no 
vector a, except a = 0, to which all vectors are perpendicular ; 
in fact, a = 0 is the only vector which is perpendicular to itself. 
Normal co-ordinate systems can be characterized by the fact 

+t The name ‘orthogonal ’’ has been used in the physical literature to 


denote these transformations, but in mathematics it is necessary to have 
different names for these two different concepts. 
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that for them the scalar products of the fundamental vectors 
e, among themselves are 
fl (t = k) 
(0 (2 4= R). 

On comparing the fundamental metric form (4.2) with (3.2) 
it is seen that the unitary space R can be characterized by the 
fact that its conjugate complex Rf coincides with its dual P, or 


(e, ey) = Oy = 


more precisely, that the conjugate complex ¢ of a vector x can 
at the same time be considered as its dual. We found that with 
a correspondence A of an m-dimensional unitary space ® on 
an n-dimensional © is associated in an invariant manner the 
correspondence A* of the dual space 2 on the dual P. As a 


consequence of the equation P = & for unitary spaces 
At =A 


is a correspondence of © on &; we call it the ‘ Hermitian 
conjugate of A." AA is a correspondence of ® on itself, 


AA of © on itself. A correspondence © which carries the 
general vector x over into x’ = Sr is unitary if it leaves the 
absolute magnitude of x unaltered: yx’* =: x3. Two configura- 
tions consisting of vectors, either of which can be obtained from 
the other by a unitary transformation, are congruent in unitary 
geometry ; i.e. unitary geometry 1s the theory of those relation- 
ships which are invariant under an arbitrary unitary transforma- 
tion. The characteristic property of such transformations is 
expressed in terms of the matrix calculus by either of the two 
equations 


SS=1, SS=1. 


Let §’ be an m-dimensional linear sub-space spanned by 
the linearly independent vectors Q), Qz, °° *, Qj. We consider 
a vector x as belonging to the sub-space R” if and only if it 1s 
perpendicular to ®’, i.e. to all the vectors of R'; such a vector 
must therefore satisfy the equations 


(a,x) = 0, (agx)=0, +--+, (a,2%) = 9. 


From these it follows that R’ is (#2 — m)-dimensional. The 
relation between ’ and RR” is a reciprocal one: every vector 
of RR” is perpendicular to every vector of ’ and conversely. 
We then have R= FR’ + RK". for if the sum xr’ + 2" of a vector 
r’ in ®’ and a vector x” in R’’ vanishes then zy’ = — fz" 1s a 
vector which belongs to both sub-spaces and is consequently 
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perpendicular to itself, and this can only occur if xy’ =0. A 
unitary correspondence which leaves §f’ invariant will also leave 
§t’’ invariant since the relation of perpendicularity will not be 
destroyed by such a transformation. Jn dealing with unitary 
correspondences or transformations 1t 18 therefore always possible 
to find an invariant sub-space R" associated with a given invariant 
sub-space R', such that R= R' +°R'". The previous remarks 
about projection suggest that here in the unitary geometry we 
identify the space generated by projecting & with respect to 
#’ with the sub-space #’’: we project on to the space Rt” per- 
pendicular to ®’. To this end we remark that among all vectors 
ain § which are congruent mod. Jt’ there is one (a) which lies 
in R’’; we then have 


(a°a) = a(a), (a + b) = (a) + (5). 


With an arbitrary linear correspondence A 


4) —> 1)’ = A !) ‘ Vie = 2, 4ikV (4.4) 
of on itself is, as we have seen, associated a bilinear form 
ou Fix oe Vk 


which depends linearly on a vector € in P and a vector 9 in &. 
In unitary space we can therefore associate the form 


A(t, )) = Aik Le Vey 


depending linearly on ) = (y,) and x = (&,), with the correspond- 
ence (4.4). It is in fact the scalar product of x and Ay. The 
special case in which 


A=A_ or A(y, rt) as A(t, y) Or ayy = Aix (4.5) 
bears the name of the French mathematician Hermite. The 
correspondence (4.4) is consequently Hermitian if the scalar 
product of x with Ay is the conjugate complex of the scalar 
product of 9 with Ay. On identifying y with r we obtain the 
‘* Hermitian form ”’ 


A(t) = A(t, t) = Lay, Z, 4, (4.6) 


i.e. the scalar product of y and Ax; in consequence of (4.5) its 
value is real. An Hermitian form or correspondence A is said 
to be non-degenerate if there exists no vector xr, except x = 0, 
whose transform Ay vanishes It is positive definite if the value 
of the form A(z) > 0 for all vectors r +0; a positive definite 
form is non-degenerate. 

The fundamental metric form (4.2) is one such positive 
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definite Hermitian form, the ‘ unit form,’’ the coefficients of 
which consist of the numbers 


Re aks 1 (2 = k) 
"= toler A) 
On introducing an arbitrary co-ordinate system a, (2 = 1, 2,+++, n) 
into the n-dimensional space, the absolute magnitude of an 
arbitrary vector 

E410, + X20, Tt + Han 
is given by 

Pa Dinh Xn, Lie = (A; Ay). 

The expression for x? is accordingly always a definite Hermitian 
form; conversely, any positive definite Hermitian form G(r) 
could be taken as the fundamental metric form. To show this 
we employ the associated Hermitian bilinear form G(z, y) to 
carry through the following procedure, which 1s patterned after 
the step-by-step construction of a Cartesian co-ordinate system. 
Choose any non-vanishing vector e,; since G(e,;) > 0 we may, 
on multiplying e, by an appropriate numerical factor, normalize 
it in accordance with the equation G(e,;) = 1. When the process 
of constructing a system of unitary-orthogonal vectors e, 


G(e,, Cx) = dix 


has been carried through m steps, 1 = 1, 2, °*+ +, m, the next 
step is accomplished by choosing a solution y= e,,4, of the 
m <n homogencous linear cquations G(e,, xr) = 0 for the 
unknown components of the vector y + 0 and normalizing it 
in accordance with the equation G(e,,,,) == 1. The procedure 
comes to an end after » steps; we then have nm vectors 
C1, @, °° *, €, of such a kind that 


G(t, t) = Ty X%y + Te X_ + aR al oe En Xn 


where 
E = yey + X%e@e + re a a One 


It follows from the equations themselves that ¢ can only vanish 
when all of its components x, vanish, and consequently the e, 
are linearly independent and constitute a co-ordinate system 
in &. 

The transition from affine to metric geometry can accordingly 
be accomplished by the introduction of the axiom: 

(5) The square of the absolute magnitude of a vector x is a real 
number x? which 1s a positive definite Hermitian form in the 
components of X. 

These last considerations are useful in another connection. 
If R' is a linear sub-space of Jt we can employ the construction 
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used above to find m vectors e@,, @:, °° *, @, 1n R’ which span RK’ 
and are mutually unitary-orthogonal in the sense of the equations 
(e,e,) = 5;,. By continuing the construction we can supplement 
these m fundamental vectors by mn -- m additional ones 
Cmit °° *) @n So that the two sets together form a co-ordinate 
system for the entire space 9. We can therefore adapt our 
normal co-ordinate system to the separation of 8’ out of ® or 
to the decomposition of R = KR’ + R” into two perpendicular 
sub-spaces. 

Since the correspondence A of ® on to itself is invariantively 
connected with the Hermitian form A in &, we may speak of 
the product BA of two Hermitian forms A, B in R, but this 
product is not in general Hermitian as 


BA = AB = AB. 


The trace of an Hermitian form or correspondence A is real. 
The positive definite expression 


tr (AA) = x ax? (4.7) 


is of particular importance. When & 1s decomposed into 
mutually perpendicular sub-spaces ta (« = 1, 2, +++) the section 
Aag of the correspondence or form A in which Wa intersects Rg 
is uniquely determined ; it is a correspondence of R, on Ra, 


and Ags, the Ba-section 5 A, is a correspondence of Qa on Rg. 
When the co-ordinate system is adapted to the decomposition 
of R we have 


tr (Aap Ae) = tr (Aue A ag) os > | axl? (4.8) 


where in the sum 7 runs through the «", & through the B™ set 
of indices. 

Any non-vanishing vector a determines a ray a which consists 
of all vectors of the form Aa, A being an arbitrary complex number. 
The generating vector a can be so normalized that its absolute 
value |a|= 1; this does not, however, determine a to within 
a change of sign, as in the real domain, as the normalization is 
unaltered on multiplying a by an arbitrary (complex) number ¢ 
of modulus 1. We shall call the totality of vectors of R the 
vector field R and the totality of rays the ray field R. Any 
non-degenerate linear correspondence A of the vector field ® 
on itself is at the same time a correspondence of the ray field 
R on itself, but this latter correspondence is unaltered by 
multiplication with any non-vanishing number. A_ unitary 
correspondence or transformation of the ray field on itself will 
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2 briefly referred to as a rotation. By the symbol S’ ~ S we 
iall mean that the two transformations S, S’ of the vector 
eld on itself differ only by a numerical factor ¢ of modulus 1: 
‘== €S, whence they both give rise to the same rotation of 
ie ray field. 


§5. Transformation to Principal Axes 


The fundamental theorem on Hermitian forms is that con- 
2rning the transformation to principal axes. We are here 
yncerned with the analogue of the familiar problem of finding 
1e principal axes of an ellipse or ellipsoid in the ordinary 
20ometry of two or three dimensions. We wish to find a normal 
)-ordinate system e, associated with a Hermitian form A(x) such 
atin addition to 


Eo X40 + Xgle + ° ses wee 
Yt = 2X, + TeX, ++ + + TX, (5.1) 
e also have 


A(t) = 404%, + GeFat, + + + OFX n | (5.2) 


1at is, A shall be brought into the normal form (5.2) by means 
f a unitary transformation. The real numbers a, @, * °°, & 
re called the characteristic numbers of the form A, and 
,@s,° °°, €, the corresponding characteristic vectors. 

To this end we first consider the correspondence x — x’ = AL 
nd seek those vectors xy =--0 which are transformed into 
\ultiples x’ = Ag of themselves by A. We then obtain the 
secular equation” 


fA) = det (AL — A) = 0 


yr the multipliers A. According to the fundamenta! theorem of 
zebra this equation certainly has a root A = a, ; corresponding 
) it a non-vanishing vector r = e, can be found which satisfies 
1e equation Ae, = «,¢,, and on multiplying this vector by an 
ppropriate numerical factor we may take it such that its modulus 
-unity. ¢@, can then be supplemented by 2 — 1 further vectors 
, °°, @, in such a way that these 2 vectors constitute a normal 
)-ordinate system. In these co-ordinates the formule 


Qe = Ae; = Jaxer 
k 
ir the correspondence A require, in accordance with the 


efinition of e,, that the coefficients ag, @3;, ° * *, @,, vanish and 
iat @,, == %, Because of the symmetry conditions a,; = dix, 
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Ayo, 433, °° *, 4, must also vanish. Hence in the new co-ordinates 
the matrix A assumes the form 


a, O 0 0 
QO Ayg og * * * Aon 
0 A30 433 ° Agn Ils (5 3) 


ee a ee, ee, ee ee, ee, ee ee ee) 


@) ane an3 eee 


and the Hermitian form becomes 
A(t) = 04,2, + A’(r) (5.3) 


where A’ is an Hermitian form containing only the n—1 variables 
Xo, %3,° °°, Xn. Repeating this process, or calling on the method 
of mathematical induction, we establish the validity of the 
fundamental theorem stated above. 

The characteristic polynomial of (5.2) is 


det (Al — A) = (A — a,)(A — a) > + + (A— o,). 


From this it follows that the characteristic numbers o,, 
%, ° * *, &,, including their multiplicity, are uniquely deter- 
mined by the Hermitian form A; their sum is the trace of A. 
What can we say concerning the characteristic vectors? Let 
a be a given real number; the vectors x which satisfy the equa- 
tion Ar = ar constitute a linear sub-space (a) of R, the 
characteristic space belonging to « When the normal 
co-ordinate system e, is so chosen that A is in the normal form, 
the equation Ar = af is, in terms of its components, 


OX, == AX; 


from which it follows that $t(«) is spanned by those vectors e, 
for which a, = a. If, for example, the three roots «,, &», «3 == « 
while all the others are different from a, the characteristic space 
F(a) is 3-dimensional. If none of the characteristic numbers 
a, is equal to a, R(«) consists only of the vector 0. This again 
characterizes the characteristic numbers, including their multi- 
plicity, in a way which is independent of the particular co- 
ordinate system chosen, and in addition it characterizes the 
corresponding sub-spaces ¥i(a). % is thus decomposed into the 
characteristic spaces R(a): R= YDR(a«); only a finite number 


of terms occurs in this sum, 1.e. those for which « is a character- 
istic number of A. A complete co-ordinate system @,, €s, °° ', ey 
for the entire space ® can be obtained by choosing a normal 
co-ordinate system in each non-null sub-space R(a). The 
normal form (5.2) is undisturbed on subjecting the variables 
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x, associated with the same characteristic number a, = « to an 
arbitrary unitary transformation. 


If, for example, « is a triple characteristic number 
Xy — Lo — Xs = & 


while the remaining «, + a, then x,¢, + x@, + %3¢, is the 
normal projection Y, of the vector x on R(a«) and 


E,{t) == DyX, + LoXq + LX 


is the scalar product of ya with itself. The equations (5.1), 
(5.2) may then be written in the invariant form 


r= LEE), A(t) = Jia E,({t). (5.4) 


R’ being a sub-space of R, any vector r can be uniquely 
broken up into zg’ + Y) where r’ lies in R’ and x, is perpendicular 
to #’. The ‘ orthogonal projection” r—>r’ = F’g is a linear 
correspondence which obviously has the property 


E’E' = E’, (5.5) 


for the projection of x’ on §’ is simply rz’ itself. Furthermore, 
the operator E’ is Hermitian, for the scalar product of 4) into r’ 
is equal to the scalar product of ’ into x’, where yy’ is the projection 
of y on R’. (The Hermitian form E’(z) is accordingly the square 
of the absolute value of x’.) We shall call Hermitian forms 
which satisfy equation (5.5) 1dempotent. 

When the sub-spaces §’. R’’ are orthogonal, the two corre- 
sponding projection operators EF’, E” satisfy the equations 


E'E"=0, E"E'=0, (5.6) 


for &’ (E"r) is the component of E’’r lying in the space R’ per- 
pendicular to E’’y. Idempotent operators which satisfy these 
equations are said to be independent. The second equation is, 
moreover, a consequence of the first, as may be seen on going 


over to the Hermitian conjugate: E”E’ = 0. If ® is decom- 
posed into several mutually orthogonal sub-spaces R’+R" + --- 
then 


rae F’rt+ BEUrtee (5.7) 


It is easily shown that the converses of all these assertions 
are also valid. If £’ isan idempotent operator and E” = 1 — E’, 
all vectors of the form E’r constitute a linear sub-space §’ and 
all vectors of the form E’’r a sub-space R”’. The equation 


~~ 


E'E" = E'E” = E'(1 — E’) =0 
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shows that the scalar product of a vector E’y in R and a vector 
Ey in R” is zero: *+E’E”y=0. The decomposition of a 
vector x into a component lying in ®’ and one perpendicular 
to R’ is accordingly expressed by 


r= Ey+ (l— EL’). 


If the two idempotent forms E’, E” satisfy the equation (5.6) 
then, as we have just seen, the two corresponding characteristic 
spaces ’, R’’ are mutually perpendicular. If the sum (5.7) 
consists of independent idempotent forms, then by the above 
the corresponding mutually perpendicular sub-spaces ft’, R”’ 
exhaust the entire space ff. 

The theorem on transformation to principal axes can accord- 
ingly be stated: An Hermitian form A associates with the real 
numbers a mutually independent idempotent Hermitian forms Ea 
such that 


l=, AS Dak. (5.8) 


Eis non-vainshing for only a finite number of values a. 
A correspondence A can be reiterated : 


AAS A 2PASAt ses 
and we can accordingly obtain polynomials 
f(A) = Col + CA + €,A* 4+ °° * +4 €,A" 


in A with numerical coefficients c. On reiterating (5.8) h — 1 
times 
A= Yolk, 
a 


whence for the general polynomial f 


f(A) = Xf(aE.. (5.9) 


The characteristic numbers of f(A) are therefore the values of 
the polynomial f(o) for the characteristic numbers « of A. This 
suggests defining the Hermitian form f(A), where f(a) is any 
ae function of the real variable a, by means of the equation 
9 

Given two Hermitian forms A, B, under what conditions can 
they be brought simultaneously into diagonal form, i.e. when is 
it possible to find a normal co-ordinate system in which 


AQ) = exit, + atety +--+ + ondatn 6,40) 
B(t) = BEX + Bobet, +++ + + Babar? 
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A necessary condition is that they commute: BA =: AB, for if 
A and B are in the normal form (5.10) BA as well as AB ts 
the diagonal matrix with elements B;a; =: «PB; This condition 
is also sufficient; to prove this, chaose a normal co-ordinate 
system in which A is already in normal form. The equation 
BA = AB requires that the matrix B = ||,,|| satisfy 


bk ap, == A; bis or (a, xe On) Dik == (), (5.11) 


We divide the indices 7, the fundamental vectors e; and the 
variables x, into classes by considering 2 and k to be of the same 
class if a; = o,. Equation (5.11) states that 0,0 when 
t and k belong to different classes. B is consequently decom- 
posed into smaller matrices B’, RB” aligned along the principal 
diagonal, corresponding to the way in which the «, are distri- 
buted in classes «’, a’, +++; the correspondence B consequently 
leaves each of the characteristic spaces }*(«’), R(a’’), °° + of A 
invariant. But we can then choose a normal co-ordinate 
system in each of these characteristic sub-spaces R(a) in such 
a way that the Hermitian correspondences B’, B” in them are 
referred to principal axes; the normal form of A 1s undisturbed 
by this procedure. 

This process can immediately be applied to any number of 
Hermitian forms: Ary number of Hermitian forms can be brought 
simultaneously into normal form if and only 1f they commute 
with one another. By a slight modification we can further 
extend this theorem to an arbitrary finite or infinite system X of 
Hermitian forms. This will be briefly discussed here, although 
in general the consideration of systems of forms or correspond- 
ence is postponed until Chap. III. Let the space ® be decom- 
posed into mutually perpendicular sub-spaces ’, MR’, + °° in 
such a way that each correspondence of the system 2 takes 
place in these sub-spaces; on adapting the co-ordinate system 
to this decomposition each Hermitian matrix A of 2 consists 
of sub-matrices A’, A”, - ++ aligned along the principal diagonal. 
If all the A’ are already multiples of the unit matrix 1 in ®’ 
and similarly for all A’”’, -- +, our goal is reached, for each corre- 
spondence A of the system then transforms ’ into itself and 
is a simple multiplication in it; similarly for RR’, +--+. But if 
this is not the case let A be a correspondence of the system 
which is not merely a multiplication in the sub-space Rt’. On 
transforming the constituent A’ of A to principal axes, R’ is 
decomposed into characteristic spaces R,’ + R,’ +--+ of A’, of 
which there are at least two. For any Hermitian matrix X 
of 2’ we have A’X’ = X’A’, from which it follows, as we saw 
above, that X’ transforms each of the sub-spaces R,’, Ry’, °° ° 
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into itself. The decomposition ft’ + R’’ +--+ can thus be 
further reduced to the decomposition (R,’ +R +: - -)+ 
R'’ + +++. Proceeding in this way we finally reach our goal 
after at most steps, proving: 

The Hermitian forms of any system & can be simultaneously 
referred to principal axes if they all commute with one another. 

The theory developed above for Hermitian correspondence 1s 
valid as 1t stands for unitary transformations. S being any unitary 
operator, a normal co-ordinate system e,; can be introduced 1n such 
a way that S carries each of the fundamental vectors e; over into 
a multiple o,e,; of 1tself. The characteristic numbers o, of S are 
numbers of modulus 1. In these co-ordinates the matrix of S 
is a diagonal matrix, the elements in the principal diagonal 
of which are the numbers o,,. 

The proof is quite analogous. We again start with the 
secular equation 


det (o1 — S) = 0 


and consider the roota,. There then exists a vector ¢, of modulus 
1 which is transformed into o,e, by the correspondence S.  Sup- 
plement e, with n — 1 further vectors ¢,, - + -, €, so that these x 
vectors form a normal co-ordinate system. In these co-ordinates 
the matrix ||s,,|| of the correspondence S: 


Se, = Ski Cr 
k 
is again of the form 
Sip = 93, Sy = = Sy = 9. 


Since S is unitary the sum of the squares of the moduli of these 
elements of the first column must be unity, whence \o,| =. 
Similarly the sum of the squares of the moduli of the elements 
in the first row must also be 1: 


Jo, |? ai Isao]? qi Pee Sin|? =1; 
but since |o,|? = 1 it follows that 
Syg = 1 = Sin = 0. 


The matrix S is now broken up into a 1-dimensional o, and 
an (n — 1)-dimensional S’ as in (5.3); the truth of the above 
theorem then follows immediately by induction. 

The further results can be obtained in exactly the same way 
as above for Hermitian forms. The characteristic numbers o,, 
including their multiplicity but not their order, are uniquely 
determined by S, and similarly for the corresponding sub-spaces. 
If we wish to find a linearly independent system of character- 
istic vectors, the fundamental vectors of each such sub-space 
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may be taken as forming a normal co-ordinate system. Finally, 
a finite or infinite set of unitary transformations can be simul- 
taneously reduced to normal form if and only if they commute 
among themselves. 


§ 6. Infinitesimal Unitary Transformations 


A rigid body in continuous motion about a fixed point O 
performs an infinitesimal rotation in each interval dr of time. 
Denoting by (dx,, dx., dx;) the infinitesimal displacement of 
that point of the rigid body which is at the point P(x,, %», %3) 
at the time 7, the equations of motion of the body must be of 
the form 


ax, 
x, = ae == = ares ik*k (6.1) 


in which the coefficients c,;, are constants, 1.e. independent 
of the particular point P under consideration. Employing a 
Cartesian co-ordinate system with O as origin, x,? + x,? + x,? 
must remain unchanged throughout the motion; this requires 
that 


7 Ba ne te 
aX OF. Lepr rp = 0, 
i CT 1k 


Since this equation must be satisfied identically in the x,, the 
matrix C == |lc;,|| which characterizes the motion must be antu- 
symmetric: ¢,; == —¢,,. Introducing the vector rt with origin 
at O and terminus at the point P, and the vector ¢ = (C93. C33, C12), 
equations (6.1) become 


the familiar fundamental formule for the kinematics of a rigid 
body. The square brackets denote the vector product and c 
the vectorial angular velocity, the absolute value and direction 
of which give the angular velocity and direction of the axis of 
rotation respectively. 

The continuous compounding of interest offers another 
example of an infinitesimal linear transformation. The interest 
rate being c, a real number, the increase in the capital x in time 
dr is xcdr. Radioactive disintegration is the same kind of a 
process with negative c. The capital x, considered as a function 
of the time, satisfies the equation 


Oe as CX (6.2) 
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and consequently increases exponentially with 7. If the prin- 
cipal has the value x, at time 7 = 0, it will have increased to 


4(7) = X%y* e* 


at time 7. To obtain an alternative solution we divide, as in 
the method of finite differences, the time interval 7 into a large 
number n of equal elements r/n; x will increase by xcr/n in 
each of these intervals and the capital x will accordingly be 
multiplied by (1 + cr/n)" at the end of time 7. The familiar 
definition 

ec? = lim € Ee =) (6.3) 

n—> n 
of the exponential function follows from a comparison of these 
two results. But we can also solve the differential equation 
(6.2) by the method of successive approximations. We take as 
the Of approximation the initial value x9: 4%9(7) = %. The 
(n + 1)st approximation is obtained from the n™ by substituting 
the latter in place of x on the right-hand side of (6.2) and 
integrating : 
Xnsi(T) = Xp + cf x,(t)de. 


0 
On carrying out this process we find 


n | 


C cr)" 
q(t) = Xof pe ee \, 
from which we obtain the familiar power series expansion 
sees cr (G3 a 
eo1eh + ort 6.4) 


for the exponential function. The convergence of (6.3) and 
(6.4) and the identity of their limits is rigorously proved by 
elementary analysis. 

These examples will assist in understanding the concept of an 
infinitesimal unitary transformation of the n-dimensional 
space Jt = ft,, which we now proceed to introduce. In order 
to avoid the use of infinitesimals we introduce a (purely fictitious) 
time 7 and think of the infinitesimal linear correspondence which 
carries the vector x over into x + dy as taking place in the time 
interval dr: 

dy ax, 


a = CX, cp == aucs Nv. 
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(For the sake of brevity we refer to this simply as ‘ the in- 

finitesimal transformation C.”) Since the transformation is 

unitary, on employing a normal co-ordinate system SZ, x, must 
i 


remain unchanged . 
YL; eee %, ee — 


On setting 
—- = Con X eee Cee 2: 
dr 2 tk ~ky de 2 kis 


the left-hand side of (6.5) reduces to the Hermitian form 


> (ix ae Cus); Xk 


s, 
and since 1t must vanish identically in the x; we must have 


Cine + Cy = 90, or the transformation C is anti-symmetric in 
the sense of the equation 


Cr=— Cy, C= —C, (6.6) 
In the real domain there exists no intimate relationship between 


symmetric and anti-symmetric matrices, but the situation 1s 
different in the complex domain. For on setting C = 1H (1 being 


the imaginary unit V -- 1) it follows from (6.6) that H satisfies 


the equation H = H, and C 1s consequently 2 times an Hermitian 
matrix. Jn an infinitesimal unitary rotation of a vector field the 


. ay. 
velocity Me 1s related toy by means of a correspondence whose matrix 
T 


1s 1 times an Hermitian matrix. The theorem on transformation 
of Hermitian forms to principal axes is accordingly the limiting 
case of an analogous theorem on unitary transformations. 
By repeated application of the infinitesimal unitary trans- 
formation 
dy = dr- Cy (6.7) 
we obtain after time 7 
E> U(r) = Ulrjy = er’e (6.8) 


where the exponential function e4 for a matrix A can be defined 


by either 
; A\* 
lim (1 + -) 


; h—>o 
or the power series 


Naturally 
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Accordingly U(r) runs through all the transformations of a 
1-parameter continuous group of unitary transformations gener- 
ated by the infinitesimal transformation C; the parameter 7 is 
additive on composition. The power series is obtained by the 
method of successive approximations; this method can also 
be applied to obtain a solution in the more general case in which 
the infinitesimal unitary transformation C is not the same for 
each time element dz, 1.e. in which C is a matrix C(r) depending 
on the timer. The solution of the equation 


ces 


= C(r)t 


ay 
4 


for this case is given by 
E(t.) = U(rery)£(73) ; 
the unitary transformation U(r.t,) which takes place in the 
time interval 7,, 7, obeys the law of composition 
Ua) = 0. (7372) U(rer}). 
If ry = 2, at time + = 0, the formule for the successive approx- 
imations f, (7) are 


Lo(t) = X03 Lrsi(t) = Lo + (CQer(t)ae } 


0 @) 
for U(r) = U(r 0) we obtain the infinite series J°U, (7) in which 
= 0 


Ug(r) =1; Unalt) == CQ) UiAY at. (6.9) 
0 
Written explicitly, 


U(r) = if ee J C(4)C(t) _a C(t,)dt, dtg> + + dt, 
(OS S45°°°St)S7) 


The proof of the convergence of this process is readily ob- 
tained with the aid of the quantity | A | associated with a matrix 
A = || a,, || by the equation 


| Aj? = tr (AA) = 2 Aix |. 


It follows from the well-known Schwarz inequality 


|a,b, + dgbg + +++ + 4,5, 
S (Ja,l? + +++ + a, l?)(1d,7%? ++ +--+ 1b,|?) (6.10) 


|A4+ B) S|Al+ [Bl 


that 


and that 
|AB| S| A] | BI. 
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The second inequality is obtained by applying (6.10) to the 
element 


Cie = Lage Ory 
of C= AB: 
fowls Ss Dd | ai,|* ° D | b,x |? 


and summing with respect toz and &. The first inequality may 
be stated in the form 
|fA(e) de| S J Ald) | ae. 

0 


0 
for integrals. The convergence of XU, (7) can now be established 


with the aid of these auxiliary results, for we can prove that 
under the assumption 


|C(t)| Se (0 St Sz) 
that 


L 

|Ui(7)| S Vin 

For this is certainly true for |! = 0, and the recursion formula 

(6.9) enables us to conclude that it holds for U,,, if it holds for 

U,. The convergence follows from this absolute convergence, 

for the absolute value of each component of the matrix A 1s 
certainly not greater than | A |. 

We have only gone into these matters to reassure the reader 
of the legitimacy of dealing with infinitesimal quantities of the 
kind met here. The only thing of importance for the following 
is the simple relation existing between infinitesimal unitary 
transformations and Hermitian forms, 


§ 7. Remarks on o-dimensional Space 


The unitary spaces which appear in quantum mechanics 
usually have an infinite number of dimensions. Such a space 
consists of all vectors 

t F, (X41, Xo,° ° :) 


whose components x; constitute an infinite sequence of numbers 
for which 


EP = Kx + XX ts’ 
converges. Within this domain addition and multiplication 


with numbers, as well as the construction of the scalar product 
of two vectors, are possible. All the axioms employed so far 
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are satisfied, with the exception of the dimensionality axiom y 
introduced in § 1. 

Since the vector components %,, %s, °° * constitute a de- 
numerable set, this ‘‘ Hilbert space ’’ has a denumerably infinite 
number of dimensions. But in addition to these, spaces of 
non-denumerably infinite dimensions may occur. Consider, for 
example, all continuous complex functions ¢(s) of a real variable 
s of period 27. Weneed not distinguish between two values of s 
which are congruent mod 27, 1.e. whose difference is an integral 
multiple of 27 ; it is consequently more convenient to consider p(s} 
as a function defined on the periphery of the unit circle than on the 
straight line. The various values of s at points on the circum- 
ference play the rdle of indices, the value f(s) at the point s being 
the component of the ‘“‘ vector #’’ with index s. The totality 
of such functions f(s) therefore constitute a linear ‘‘ function 
space’ of continuously infinite dimensions. Addition of these 
vectors and multiplication by a number have here the same 
interpretation as in the ordinary operations with functions. 
The square of the absolute value of the vector # is taken to be 


(Y, b) = [P(s)p(s)ds 


and the scalar product of two vectors ¢ and y# as 


($, #) = J F(s)y(s)ds. 


A set of functions 


(5), p2(5), 2 ey $x(5) 


constitutes a unitary-orthogonal system of vectors if 


($s(s)bx(s)ds = Oy. 


These vectors span an n-dimensional sub-space ®,, of the oo-di- 
mensional function space, 1.e. that sub-space consisting of all 
vectors of the form 

p(s) = %f4(S) + Xepal(s) + °° * + Xahals). 
X%1, Xg, °° *; X, are the components in the co-ordinate system 
d,, bo, ° * *, On of the vector f(s) in R,. We have 


(, b) = [Bls)b(s)ds = yxy + ata +o + + Fhe: 
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An arbitrary vector % can be broken up into a component ¢ 
which lies in 8, and a component #’ perpendicular to R,: 


p=o+y, 
83) = Be bls) mi = 0. 


It follows from these equations that [cf. (4.3)] 


x. = | b,(s)p(s)ds 


These integrals are called the Fourter coefficients of the function 
uw with respect to the orthogonal system ¢, The orthogonal 
projection ¢ on §,, cannot be longer (i.e. have greater absolute 
magnitude) than q@ itself; this is the content of the so-called 
Bessel inequality 


MX, + MX, + °°: + + HX, S  P(s)p(s)ds. (7.1) 


In fact, since (d, o’) = 0, (f’, 6) = 0, the ‘‘ Pythagorean theorem” 
(pb, ) = (6, ) +, #) 
holds. 


The simplest unitary-orthogonal system in the domain of 
periodic functions, with which the theory of Fourier series is 
concerned, consists of the functions 


== e(ns) (2 = 0, -b 1, 4:2,°° 3°53 efx) = et]. (7,2) 
ME Qar 
This infinite system has the property of completeness; it 
is a complete co-ordinate system for the entire function space. 
The theorem that any periodic function (s) can be expressed 
as a linear combination of the functions (7.2) : 


B(s) = = > X,°e(ns), X%,= ss | é(ns)\b(s\ds 


(Fourier expansion of y(s)) is true only if certain conditions 
concerning the differentiability of #(s) are fulfilled, but any 
continuous function satisfies Parseval's equation 


Qn a 
[ PlsW(s\ds = TS ¥axm (7.3) 
0 m= —-0O 
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We learn from this example that there 1s no essential distinction 
between spaces of a denumerable and of a non-denumerable infinitude 
of dimensions ; we have introduced into our function space 
a complete normal co-ordinate system (7.2) consisting of a 
denumerably infinite set of fundamental vectors. In an n- 
dimensional unitary space a system of unitary-orthogonal 
vectors is complete if their number is 2, but not if it is less; 
however, such an enumeration gives no criterion for oo-dimen- 
sional space. If we leave out a finite number of the functions 
(7.2) we still have an infinite set left, but the completeness of the 
system is destroyed thereby. The real criterion for complete- 
ness lies in the validity of the completeness relation (7.3). 

We can understand the relations existing in Hilbert space 
by analogy with or as limiting cases of those existing in spaces 
of a finite number of dimensions. If we consider the values of 
an arbitrary periodic function w(s) only at the points 


Qa 2a Qa 
ge, eee au eee 
a e pe 
and set 
Qarv 
vi 1 Nee 


we are dealing with an n-dimensional vector space in which the 
components of the arbitrary vector % are these quantities 


& (v=0,1,-°+-+,2—1). Let e, be the vector in this space 
with components 
1 2anr 
=e ( g “) ps ee gee 
Vn 
these vectors e, (A= 0, 1,+ + +, m — 1) constitute a normal co- 


ordinate system for the space, relative to which the vector & 
has the components %, %1, * * *, Xn, Which are to be calculated 


from 
n—1 
a 1 Se (2A, 
v Jn n Ae 
A=0 
In accordance with (4.3) 
n-—1 
pas Se (a 
A /n n v 
y= 0 


whence 


n-l n-1 
DEE, = SS tH 
v=Q@ A=(Q 
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By passing to the limit 2 — oo we obtain the equation of Parseval. 
We do not concern ourselves here with the further considerations 
which may be necessary to establish a rigorous proof, but content 
ourselves with such reasoning by analogy. 

We consider the linear correspondence or ‘ operator ” 


LD) .o which transforms a function p(s) in the domain of 


dip 


periodic functions into ae e(ns) 1s the characteristic vector 


(characteristic function) of this operator belonging to the 
characteristic number rn: 


1 de(ns) 
1 ds 
This operator is Hermitian; the scalar product of @ and Dy 


is the conjugate complex of that of J and Dd, where ¢ and 
are any two periodic functions, for by partial integration 


== n+ e(ns). 


1 ay 1 dd 
| Bs) - 3 Gods = — [we 5 Fa 
0 0 
and the right-hand side 1s conjugate to 
2x $ 
ld 
0 
In fact, the Hermitian form 
2x 
Le 7 dp 
; |b Fas 
0 
assumes the normal form 
+00 
Oe NEnX y (7.4) 
n= — 00 


in the normal co-ordinate system whose fundamental vectors 
are the characteristic vectors of the operator D. The reiterated 


2 
operator DD = — _ appears in the theory of the vibrating 
string, together with the corresponding Hermitian form 
2n 2x 
dy, __ (db dp 
-|¥ ds ds == \5 qs 
0 0 


which represents the kinetic energy of the string. 
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We have here been dealing with a discrete spectrum of char- 
acteristic numbers. But in an oo-dimensional space Hermitian 
forms with a continuous spectrum can also be constructed. 
Consider, for example, the function space consisting of all con- 
tinuous functions y(s) defined in the interval -wSsS4a; 
the square of the absolute magnitude of the ‘‘ vector ”’ & is then 


(f, p) = J P(s)pP(s)ds. 
The Hermitian form 
+2 
Alp] = J sih(s)b(s) ds (7.5) 


is already in normal form, which shows that it has as character- 
istic numbers all numbers between — 7 and + 7. The functions 
(7.2) again constitute a complete normal co-ordinate system in 
terms of which 

+0 


n=— 0 
Substituting this in (7.5) we find 
ee 


yields 0 when n = m and by partial integration 
as +7 = n—m 
E e alin — NST) = pS 
in—m) J_- i(n — m) 


when » =m. The Hermitian form 


1 (— L)n—m | ; 

7>, n--mo™" 

nm 
has therefore as characteristic numbers all values between 
— wand + 7. 

The characteristic vector %, belonging to the characteristic 

value «(—7 Sa S+ 72) of Af] is that function which vanishes 
at all points s += « and is there so large that the integral of 
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Data has the value 1. Of course such a function does not really 
exist, but we can approximate it as closely as we wish. In 
order to arrive at a formulation which is mathematically rigorous 
for the case of continuous spectra, we must introduce in place 
of the idempotent Hermitian form E&, in (5.4) the idempotent 
form AE = YE, for the entire interval A = A8 (2 SA< 8). 


asi<Bp 
For any given vector x 


AE(t) 20,  A*E(t) + AXE) = AE() (7.6) 


and the idempotent forms AE associated with two separated 
intervals A are mutually independent. 

In dealing with the continuum, the sum in (5.4) is replaced 
by a Stieltjes integral. Consider the straight line described by 
the real variable A as being covered with a substance, and let 
the amount of this substance on the interval A be denoted by 
Am. We then have, in analogy to (7.6), 


Am = 0, Am + Atm = Arm. 


If J(A) is a continuous function of position we can construct 


the integral 
1 


| d(A)dam. (7.7) 
0 
An ape een to this integral can be found by dividing the 
entire interval 0 <A S1 into small intervals A;, choosing a 
point A, in A, and ev aluating the sum 3’d(A,) ° Aym. This sum 


then converges to the integral on allowing the A, to approach 
zero. If the distribution has a continuous density 
Am 


lin —~— ==: p(A 
en ae a 


i 
the integral is identical with J #(A)p(A)dd. But the Stieltjes 


0 
integral (7.7) also includes the cases in which there exists no 
finite continuous density ; in particular, it allows the existence 
of discrete points at which a finite amount of the substance 1s 
concentrated. If the substance 1s distributed over a finite 
number of points A == a, in amounts m,, the Stieltjes integral 
reduces to the sum D’d(a,)1;. 


t 
We thus arrive at the following more inclusive formulation 
of the fundamental theorem concerning the transformation to 
principal axes: (1) The Hermitian form A associates with each 
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interval A an idempotent form AE (rt); (2) when two adjacent 
intervals A,, A, are added together to form an interval A, 

AE = A,E+ A,B, 
and the idempotent forms associated with separated intervals are 
independent ; (3) we have 


+ 0 + 0 
v= fdE(x), Alt) = Jr- dE). 


In this form the theorem is adapted to the appearance of con- 
tinuous spectra of characteristic numbers, and is particularly 
appropriate for the purposes of quantum mechanics (cf. IT, § 7). 
The discrete characteristic numbers lie at those points where 
the monotonic increasing function A* F(z) = E(A; r) of A has 
a discontinuity. In our example (7.5) 


B 
APE(] = [H(s)p(s)ds ; 


here % must be taken as 0 outside the interval (— 7, + 7). 
The evaluation in terms of the co-ordinates x, 1s readily accom- 
plished. 

Consider the function space consisting of the totality of 
all functions %(s) of a variable s, which assumes all values from 
— oo to + o, and which have a finite absolute magnitude 


(pb, b) = J P(s)p(s)ds, 


i.e. which are ‘“‘ integrable square.’’ The characteristic functions 


; ld 
associated with the linear correspondence f(s) > ; = are again 
the functions e(vs), but the frequency v can now assume all real 
values. The components of #(s) are the quantities 

-+ 00 


1 
ioe sa \ Hse vs)ds. 


16 @) 


Fourier’s integral theorem then allows us to conclude the validity 
of the expansion 
+ 00 
1 
S) = —=—]elvs d 
Hs) = ex elvs)f(r)do 
—- © 
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under certain assumptions concerning the differentiability of 
the function ¢(s); but in any case the completeness relation ? 


is valid. We arrive at a somewhat different problem when we 


only require that the functions ¢(s) be such that (s)p(s) 
possess a definite mean value 


+a 


lim 5-[Bblods = (, ¥): 


this leads to the theory of almost-periodic functions developed by 
H, Bohr.?, Here again the validity of the completeness relation 
can be established. 

The theory of the characteristic numbers of Hermitian forms 
in infinitely many variables has been developed by Hilbert and 
Flellinger,® but it is applicable only to bounded forms 


A(t) a 2,4 int X 


t, 


i.e. forms whose values have a fixed upper bound when 
x oe PTET = I. (7.8) 
i 


Indeed, without this assumption, we cannot guarantee the 
convergence of A(x) in the entire domain (7.8); as an example 
consider the form (7.4), )’int,x,. That this form only converges 


n 
in a portion of the domain (7.8) is merely another expression of 
the fact that not every continuous function is differentiable. 
The situation is more favourable for unitary forms as they 
satisfy the condition that they be ‘ bounded” in consequence 
of their very definition; a unitary transformation is thereby 
to be taken as satisfying both of the conditions 


UU=1, UU=1. 


The theorem on principal axes has been proved rigorously for 
bounded Hermitian and for unitary correspondences in o- 
dimensional space. A method due to A. Wuntner + seems 
particularly appropriate for dealing with unitary correspond- 
ences; it 1s based on the consideration of the discrete group of 
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all powers U* of the given unitary transformation U, and deter- 
mines the monotonic increasing function E(A;z) of the real 
variable A (0 SA S 27) by means of the equations 


Ur(t) = fed, E(A; 2) (7.9) 


0 


(the problem of trigonometric moments). ¥. v. Neumann ® has 
gone furthest in dealing with linear operators for which bounded- 
ness is not postulated. In accordance with § 6 with a Hermitian 
form A is associated a group of unitary correspondences e'"4 = U(r) 
depending on the real parameter 7 and satisfying the equation 


U(r + 7’) =: U(r) U(z’) ; (7.10) 


the study of this group is equivalent to the study of A. It is 
therefore perhaps appropriate to replace this latter for o- 
dimensional space by the former, for no convergence difficulties 
appear in the domain of unitary transformations. We must 
therefore attempt to bring the operators U(r), which are con- 
tinuous functions of the real parameter 7 satisfying (7.10) 
simultaneously into the form 
22 
U(r; t) = fed, E(A; 0). (7.11) 


0 


This is accomplished with the aid of Wintner’s method on re- 
placing the discrete parameter m in (7.9) by the continuous 
parameter 7. The problem (7.11) bears the same relation to 
(7.9) as Fourier’s integral bears to Fourier series. 

In setting up a system of axioms for co-dimensional vector 
space the axioms («), (8B) of § 1 and the metric axiom (8) of § 4 
can be retained; for the proper substitute for the dimension 
axiom (y) see, e.g., v. Neumann, ‘‘ Mathematische Begriindung 
der Quantenmechanik.”’ ° 

The algebraic and geometric tools developed in this chapter 
offer a natural medium for the expression of quantum mechanics ; 
they already hold a dominating position in the classical physics 
of continuous media. A masterly exposition of their mathe- 
matical content and application is found in the first part of 
Courant-Hilbert’s ‘‘ Methoden der mathematischen Physik,” 
2nd ed. (Berlin, 1930). 


CHAPTER II 
QUANTUM THEORY 


§ 1. Physical Foundations ! 


HE magic formula 
iE = hb (1.1) 


from which the whole of quantum theory is developed, establishes 
a universal relationship between the frequency v of an oscillatory 
process and the energy & associated with such a process. The 
quantum of action / is one of the universal constants of nature 


h = 6-547 < 107?’ erg secs. 


It was first discovered by Planck at the turn of the century in 
the laws of black body radiation ; that is, radiation which 1s 
enclosed in a cavity and is in thermodynamic equilibrium with 
matter of a definite temperature, which by emission and ab- 
sorption causes an exchange of energy between the various 
frequencies contained in the radiation. Since this equilibrium 
is independent of the particular nature of the matter involved, 
Planck considered, as a kind of schematic matter, a system of 
linear oscillators of all possible frequencies. A charge oscillating 
with frequency v interacts with the electromagnetic field by emitt- 
ing and absorbing radiation of the same frequency. Planck as- 
sumed that the exchange of energy took place in integral multiples 
of an energy quantum ¢€; he at first considered this assumption 
merely as a mathematical device, and intended to pass to the 
limit ¢€ = 0. In order to obtain agreement with the Wien 
displacement law, which was derived from general thermo- 
dynamical principles, the energy quantum associated with a 
definite frequency v must be taken proportional to v: € = hy. 
In this way Planck obtained his radiation formula, which is in 
excellent accord with observation ; according to it the amount 
of energy contained per unit volume in the spectral interval 
v,v + dv in thermodynamic equilibrium at temperature @ 1s 


3 
ie (1.2) 


= ns Cau 1) 
41 
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where c is the velocity of light and k the Boltzmann constant 
(3k8@ being the mean energy of an atom of a monatomic gas at 
temperature 8). On passing to the limit h = 0 we obtain the 
Rayleigh-Feans radiation law 

Sav? 


u(y) = oe ké. 


The assumption of the validity of this latter law for the entire 
spectrum is in gross disagreement with the facts, as it would 


lead to an infinite value for the total energy {u(v)dr ; a state of 


equilibrium would therefore be impossible with given finite 
energy. 

The idea of a quantized exchange of energy, which occurs 
in Planck’s derivation somewhat schematically and only in 
application to statistical thermodynamical consequences, was 
first seriously applied to individual atomic processes by Einstein. 
In 1905, guided by the observations of H. Hertz, Hallwachs 
and Lenard on the photo-electric effect, he enunciated the idea 
of a light quantum or photon as “an heuristic viewpoint con- 
cerning the generation and transformation of light ’’ 2 according 
to which not only the exchange of energy between matter and 
radiation of frequency v occurs in quanta of amount hy, but 
further, light of frequency » can exist in the ether only in quanta 
of energy hy. The decisive experiments were first performed 
by Millikan ten years Jater. By allowing ultra-violet or X- 
radiation of frequency vy to fall on a metal plate electrons are 
released whose kinetic energy (as was already known to Lenard) 
increases with the hardness (i.e. with decrease of wave-length) 
of the incident radiation; the energy with which the electrons 
are emitted is, however, not influenced by the intensity of the 
radiation. The exact relation predicted by Einstein is 

2 

hyip Ss eV 

2 
where — e, m and v are the charge, mass and velocity of the 
electron, respectively. The energy hy of the photon is trans- 
formed into kinetic energy of the electron, after subtracting 
from it the work P required to pull the electron out of the metal 
surface. If the potential difference between the metal surface 
and a plate placed in front of it is V’ the electron current will 
disappear as soon as V’’ exceeds the critical value V,) = lad 
Millikan found that the potential at which the current vanished, 
obtained by extrapolation, was in fact exactly proportional to 
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the frequency v for monochromatic light of various frequencies, 
and that the constant of proportionality was equal to the 
quotient of the k obtained by Planck from black body radiation 
and the elementary quantum of electric charge e, The differ- 
ence of the mean energy P for two different metals is furthermore 
equal to e times their contact difference of potential. The 
value of P, or at least its order of magnitude, is therefore known, 
and we find that for X- rays of a few Angstréms wave-length 
(1A == 10-® cm.) P is negligible in comparison with hv. The 
equation 
2 
hy = = eV (1.3) 
2 

governs not only the generation of secondary cathode rays by 
primary X-rays, but also the inverse process: the transformation 
at the glass wall or on the anode of the incident cathode rays 
into the impulse radiation first observed by Rontgen. If an 
electron which has run through the potential drop — V in the 
X-ray tube loses its entire energy on collision, a photon of fre- 
quency v and energy hv = eV will spring into existence. The 
electron may, however, only be slowed down; consequently 
vis only the upper limit for the frequency of the impulse radia- 
tion, which will therefore consist of a continuous spectrum with 
a sharp limit at y= - The old classical theory of radiation 
was entirely unable to account for this most characteristic 
property of the impulse radiation. The frequency of the limit 
increases in proportion with the applied potential—and this 1s 
the exact formulation of the fact that “ the higher the potential, 
the harder the rays’’ so familiar to every X-ray operator. 

The observed phenomena thus confirm the hypothesis that 
radiation of frequency v can be absorbed and emitted only in 
quanta of energy hy. This hypothesis will of course have further 
consequences for the theory of the structure of matter. The 
Planck oscillator will, for example, be unable to alter its energy 
continuously since it can only emit or absorb these fixed quanta 
of energy, and it will consequently spring to and fro on the rungs 
of its energy ladder, which are equally spaced at intervals hy ; 
v 1s here the frequency of the oscillator, a constant determined by 
the constitution of the oscillator. An application of the essential 
elements of this idea to actual atoms gave rise to the frequency 
rule enunciated by Niels Bohr (1913) : : 

An atom can exist only in certain discrete stationary states 
(“‘ quantum states"’) in which it does not radiate. Light will be 
emitted on transition from one State into another ; the energy which 
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it loses in this transition, the difference E, — E, of its energy in 
the two states, will be transformed into a photon of energy hv, the 
frequency v of which 1s determined by the equation 


hv = is =< Fy. (1.4) 


In this equation E,, E, may be any two of the discrete energy levels 
(E,>E,). Conversely, in absorption a photon raises the atom from 
the energy level EK, to a higher E, by giving up its energy hv to the 
atom. 

According to classical electrodynamics an atom should 
continually emit radiation in consequence of the vibrations of 
its constituent electrons, and the frequencies of the emitted 
light should agree with the frequencies of the simple oscillations 
into which the motion of its electronic system can be resolved. 
But the atom will itself lose energy through this radiation, the 
motion of its electrons will thereby be modified and the fre- 
quencies will consequently be displaced. This entire point of 
view 1s therefore irreconcilable with one of the most fundamental 
physical facts: the existence of sharp spectral lines. On the 
other hand, Bohr’s assumption is not only in agreement with 
this fact, although it offers no such detailed picture of the 
reaction between matter and ether as the classical theory, but 
contains in addition the fundamental Ritz-Rydberg combination 
principle. If we order the energy levels in an increasing series 
Fo < FE, < F,<:+-+ +, then in accordance with (1.4) each 
frequency v is the difference of two “ terms” v, = E,/h, 


v(i—> k) =v, — vy (1 > k). 


Consequenily there will occur in addition to the frequencies v(1 —> k), 
v(k -> 1) the frequency 


v(t —> 1) = v(t > k) + v(k > 2D) (1.5) 


obtained from them by addition. This combination principle 1s 
valid without exception in the whole of spectroscopy, in the 
optical region as well as in that of X-rays, and has proved to 
be a valuable guide in the classification of spectra; it reduces 
the complex line spectra to the simpler term spectra. Un- 
fortunately the problem is made more difficult by the fact that 
not all lines corresponding to possible transitions 1—>k need 
actually occur—not every term y; need ‘‘combine”’ with a 
given term v,—for the conditions of excitation may be such 
that certain lines have zero intensity. The selection rules for 
the allowable transitions will therefore be contained in the 
rules which determine the intensities of spectral lines. The 
combination principle, or the Bohr frequency rule, determines, 
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so to speak, only the keyboard of the spectrum—which tones 
are really struck 1s dependent on the mode of excitation. But 
it will in general be possible under proper conditions of ex- 
citation, e.g. the influence of strong external electric fields, to 
bring out the lines which are not observed under ordinary 
conditions. 

In the “ unexcited’ or normal state the atom 1s 1n the stationary 
state of lowest energy Ey, and consequently only the lines of the 
‘series ’’ 1 — 0, of frequency v, — vp (2 = 1, 2, °° +), occur in 
absorption. The lowest of these 1 > 0 (i.e. with greatest wave- 
length), or more precisely the lowest which is not forbidden by 
the selection rules, is called the ‘‘ resonance line.” 

The simplest atom is that of hydrogen ; in it a single electron 
of charge — e revolves about a nucleus of opposite charge + e. 
The terms of the spectrum of atomic hydrogen are found by 
observation to be given by the equation 

Vy, R 

em (1.6) 
where R = 109700 cm.~! is the Rydberg constant (spectroscopists 
are accustomed to give the wave number v/c, the reciprocal wave- 
length, instead of the frequency v). The energy levels corre- 


Rhe 


sponding to these frequency terms are £, = — or To this 


discrete term spectrum we must add the continuous spectrum 
E20; the additive constant in the energy is so chosen that 
E = 0 separates the hyperbolic electron orbits from the elliptic. 
The Balmer series consists of the lines » — 2 with wave numbers 


RS) (123 3,-4,.5 9 3), 


This is the oldest known series formula; Balmer obtained it in 
1885 by abstraction from the first four lines of the series, called 
Hz, He, Hy, Hs, which lie in the visible region. The lines of 
this series converge with increasing » to a limit with wave 


number = (wave-length a= 3650A J. gall is the work required 


4 

to ionize an H-atom in the stationary state m = 2, 1.e. the work 
required to remove the electron from such an atom without 
leaving it with kinetic energy. The continuous spectrum, 
arising from transitions which ionize the atom, will join on to 
this series limit on the short wave side. We are further ac- 
quainted with the Lyman series n + 1 which lies in the ultra- 
violet and also occurs in absorption, the Paschen series n — 3 


46 QUANTUM THEORY 


lying in the infra-red, and finally with some members of the 
Brackett (n> 4) and Pfund (n -> 5) series in the far infra-red. 
In order to ionize hydrogen in the normal state an amount cRh 
of work must be done; the corresponding “‘ ionization potential,”’ 
i.e. the potential difference an electron must traverse before it 
is able to ionize atomic hydrogen by means of its kinetic energy, 1s 


Vs = =: 13-53 volts. 


Bohr’s frequency rule goes beyond the combination principle 
in asserting that the terms are actually energy levels, an assertion 
irrelevant to and not verifiable by spectroscopy. That this ts, 
however, in fact the case is confirmed by the experiments of 
Franck and Hertz on collision phenomena.* In these experiments 
electrons are given an amount eV of kinetic energy by allowing 
them to pass through an electric field of known potential differ- 
ence — V and are then allowed to pass through a gas consisting 
of the atoms which are to be investigated with the velocity thus 
obtained, without further influence from external fields. The 
electron can give up no energy to the atom until eV is greater 
than the excitation energy &, — &, of the resonance line; if 


oe ig ~<eV <= Lh, =— fy 


é 


then the electron can either suffer an ‘elastic collision,” in 
which case it loses no energy, or it can suffer an “ inelastic 
collision,’’ in which case it loses an amount ££, — if, to the 
atom. The electrons which have passed through the gas are 
of two kinds, those with kinetic energy eV and those with 
eV — (E, — Ey). When the atoms which have been raised 
from the state 0 to the state 1 by collision with electrons fall 
back into the normal state they emit the resonance line and, 
under the above conditions, only this line. This is fully con- 
firmed by the experiment. The kinetic energy of the emerging 
electrons is measured by introducing a retarding potential V’ : 
the electrons only come through it if their energy is greater 
than eV’. In general the electrons possess a discrete ‘‘ energy 
spectrum ”’ after collision with an atom of the gas; the possible 
energy values are 


eV, =eV — (EL, — E,) 


(n = 0, 1, 2,-- +, in so far as V,’ is still positive; we here dis- 
regard the possibility that a single electron may suffer more than 
one inelastic collision). On allowing the retarding potential V’ 
to decrease gradually from a value which is greater than V the 
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electron current decreases suddenly whenever V’’ passes through 
one of the values Vj, V,,° °° 


Bohr’s frequency rule reduces the determination of spectra 
to the problem of obtaining the stationary states and the correspond- 
ing energy levels of an atom, i.c. of a mechanical system of known 
dynamical constitution. The example of the linear oscillator 
given above and the fundamental notions of the theory of 
oscillations suggest the following as a general guiding principle 
(P): the frequencies derived from the energy levels by means 
of Bohr’s frequency rule shall correspond to the frequencies of 
the simple vibrations into which the actual motion of the atomic 
constituents can be resolved in accordance with the laws of 
dynamics. Such a resolution into simple oscillations is con- 
vincingly attainable in classical mechanics only if the system 
is ‘‘ multiply ” or ‘‘ conditionally periodic,” and for this case it 
was actually found possible to sharpen the general principle (P) 
into a definite rule for quantization. In the years 1913-25 the 
application of this quantum rule yielded a great harvest of 
results, and it seemed that we were in possession of the key that 
would unlock the mysteries of atomic processes. But the wards 
did not quite fit; toward the end of this epoch its failure became 
more and more apparent and the physical theory was gradually 
reduced to a symbolic calculus of quantum numbers which had 
to be corrected cach time a new fact was discovered. We do 
not wonder now that it ran such a course, but rather are surprised 
that it was as successful as it was | 

From the beginning the quantum rules were a compromise. 
If a mechanical system of one degree of freedom undergoes a 
periodic motion the frequencies v of the simple vibrations into 
which its motion can be resolved are integral multiples of a 
fundamental frequency w. This frequency depends on the 
energy of the orbit under consideration, and this latter is re- 
stricted by the quantum rules to the discrete set &,. The 
internal frequencies of the motion are therefore given by the 
formula 


v= k*w(n) (1.7) 


which depends on the two integers 7 and k. By the analogy 
with quantum mechanical frequencies this internal frequency 
(1.7) is to be ascribed to the jump »—> (wm — k). The fact that 
v depends linearly and homogeneously on the jump & is expressed 
by the ‘‘ classical combination principle "’ 


v(r—>n—k)+rn>on—lh=rvn>n—k—I) (1.8) 
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in consequence of which frequencies with the same initial state 
n will combine. But this is not in accord with the correct 
combination principle 


vn>n—k)+vn—k>n—k —1l) =v(n>n—k — I) (1.9) 


The changes k, / in the quantum number are here the same as 
in (1.8), but the final state n — k of the first frequency coincides 
with the tnztial state of the second; only for quantum numbers 
n which are large compared with & and / does the classical 
principle agree asymptotically with the Ritz-Rydberg com- 
bination principle. Consequently if the general principle (P) 
is to be satisfied without compromise our mechanics must be 
altered in such a way that the false combination principle (1.8) 
is replaced by the correct one (1.9). In 1925 Heisenberg dis- 
covered a way in which such an alteration can be naturally 
accomplished ; in order to do this, however, it was necessary 
to give up the picture of an atom with its electronic orbits. 
The quantities with which the Heisenberg theory deals are 
only the frequencies and intensities of radiation associated with 
transitions between the various states of the atom. 

It should be observed that the correct combination principle 
(1.9) is in one important respect simpler than the false one (1.8). 
As the formulation 


y(n" > n') + v(n’ > n) = v(n"’—> n) (1.10) 


shows, the quantum numbers serve only as distinguishing marks 
or indices which do not involve a law of composition, whereas 
the classical formula requires the addition of quantum numbers, 
which are therefore numbers on a definite scale. 

Another approach to quantum mechanics was discovered 
by L. de Broglie and E. Schrodinger. This approach seems to 
me less cogent, but it leads more quickly to the fundamental 
principles of quantum mechanics and to the most important 
consequences for experimental science. We shall therefore 
follow it, since we are more concerned in giving a short but 
comprehensive account than in giving a complete discussion of 
the physical foundations. The physical, essentially statistical, 
interpretation of the theory, with which Schrodinger has not 
been entirely in accord, is due mainly to M. Born. 


§ 2. The de Broglie Waves of a Particle 


We consider the undulatory character of light as guaranteed 
by the phenomena of diffraction and interference. Their most 
decisive feature is that with them we are dealing with the linear 
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position of waves with arbitrary differences of phase. From 
1athematical standpoint, they are characterized by the fact 
they involve addition and multiplication with complex 
ers, and we are consequently dealing with vectors in a 
lex space. We can, in fact, consider a complex function 
“yz) employed in the description of the phenomena and 
>d over time and space as such a vector, where each space- 
point represents one dimension of a complex vector space ; 
ifferential laws for such a wave function %—or for several 
functions simultaneously, such as the components of the 
ic and magnetic field strengths—are linear and homo- 
us. But on the other hand the quantum phenomena 
1 we discussed above speak just as plainly in favour of 
orpuscular nature of light. The intensity of the mono- 
natic radiation employed in the production of the photo- 
ic effect has no influence on the velocity with which the 
‘ons leave the metal; it influences only the frequency of 
svent. Even with intensities so weak that on the classical 
y hours would be required before the electromagnetic 
‘y passing through a given atom would attain to an amount 
. to that of a photon, the effect begins immediately, the 
s at which it occurs being distributed irregularly over the 
2 metal plate. This constitutes a proof of the existence of 
yns which is no less direct than the proof that «-particles are 
rpuscular nature by observing the scintillations caused by 
on striking a sensitized screen. Further, if one considers 
>xchange of momentum in addition to that of energy in 
ing the laws of black body radiation, conflict with Planck’s 
thesis concerning energy quanta can be avoided only by 
ning that in addition to the emission of the energy quantum 
quantum hv/c of momentum is emitted in a definite direction, 
ucing an equivalent reaction on the atom.* We here replace 
ontinuous radiation of a spherical wave by the discontinuous 
sion of photons in definite directions which are irregularly 
ibuted over the compass. 

Ve unite the two standpoints by retaining the linear wave 
‘ton, but considering the intensity pb as the relative probability 
the photon appears at the point (x, y, 2) at time t; or, more 
sely, that 


b wb dxdydz (2.1) 


e probability that at time ¢ it will be found within the small 
llelepiped with sides of length dx, dy, dz about the point 
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(x, v, 2).* But we can only expect to arrive at a rational theory 
if we deal with material particles in the same way as with photons. 
This point of view was developed in the Bose-Einstein treatment 
of an atomic gas, which paralleled that employed in the theory 
of black body radiation (“light quant gas’’).6 Schrodinger’s 
researches took as their point of departure the Hamiltonian 
theory of mechanics, which was originally obtained by Hamilton 
himself from an analogy with geometrical optics. He argued 
that since we replace geometrical optics, with the aid of which 
interference and diffraction cannot be treated, by wave optics, 
it is reasonable to attempt the analogous transition in mechanics. 
The results amply justified the attempt. The investigations of 
Davisson and Germer, which prove the existence of interference 
in beams of electrons reflected from a crystal lattice, were already 
in progress when de Broglie published his theory. The experi- 
mental evidence that moving material particles behave in much 
the same way as a beam of light with respect to these phenomena 
is now fully established, and with no less certainty than for 
X-rays, by a series of further investigations by the same 
authors and by G. P. Thomson, F. Rupp and others.?. The 
real difference between “ light-like ’’ and “ electron-like’’ beams 
lies in the fact that the particles composing the latter possess 
charge and proper mass and can consequently be deflected by 
electric and magnetic fields. 

A simple oscillation is one in which the function y, defining 
the state of the system, depends on the time in accordance with 


the law 
W(t) = are (2.2) 


where a and v are independent of ¢t. [We choose as our unit 
of angular measure that one which proves most useful in differ- 
ential calculus, for it yields the simple relation 


dade (2.3) 


1 ax 
for the fundamental trigonometric function e'* = e(x), The 
sum of the angles about a point is then 27; it would, admittedly, 
be more correct from the integral standpoint to take this as 1, 
but then the factor 27 would appear in the differential relation. 
y/27 is the number of oscillations in unit time; we shall not 


* Just as in the classical wave theory we have an expression for the flow 
ot energy in addition to its density, so in the more refined formulation of 
quantum theory we will have an expression for the probability that the 
photon passes through a given element of surface (‘‘ probability current ’’) in 
addition to one for the probability that it be found in a given element of 
volume (‘‘ probability density ’’). 
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hesitate, however, to use the name ‘frequency’ for vy. Jf we 
denote Planck’s constant of action by 27h instead of h, and we shall 
throughout the present work, the fundamental formula (1.1) 
will still be valid in the new nomenclature.] In accordance with 
(2.3) the simple oscillations (2.2) are the characteristic functions 
of the linear Hermitian operator which carries p over into 


—-+-"; the corresponding characteristic numbers are the 
t 


energies HE =: hy. If the dependence of a state of the system on 
time is described by a superposition of simple oscillations 


b(t) = aye! 4 agent foe ss, (2.4) 
the energy is capable of assuming only one of the values hy, 
hv,, : + +, and we shall take the intensity a,a, = | a, |* of the 


oscillation of frequency v, in W as the relative probability that 
the energy is observed to be hv,. The relation /: = hv 1s accord- 
ingly to be interpreted: if v 1s indeterminate because an entire 
spectrum of frequencies v 1s contained 1n the oscillatory process, then 
the energy is indeterminate to the same extent ; the intensities 
with which the various simple oscillations occur in the process 
measure the probabilities of the corresponding energies. The 


h a 
operator — oe) represents the energy : 


dt 
hd 

H- eT (2.5) 
in the following sense: a characteristic function of (2.5) represents 
a State in which the energy assumes a definite value E with certainty. 
This value is the corresponding characteristic number ; in an 
arbitrary state the components a of ys with respect to these character- 
istic functions determine the relative probabilities aa of these 
values fs. 

According to the theory of relativity energy is to be con- 
sidered as the time component of a 4-vector whose spatial com- 
ponents constitute the linear momentum ) = (p,, py, pz). The 
fundamental metric invariant of the two vectors running from 


rn 


the origin to the points (t, x yz), (t’, x’y’z’) is the scalar product 
c*tt’ — (xx’ + yy’ + 22’). 
Under a Lorentz transformation, which transforms from one 
space-time co-ordinate system to another equally permissible 
one, the quantities 
ct, — x, —y, — 2 

must consequently transform contragrediently to t, xyz; thev 
are therefore the components of the vector associated with 
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(t, x yz) inthe space which is the dual of the 4-dimensional space- 
time world. Such a dual vector is given by 


H, — Pr, — Pu — Pz} 
or, what amounts to the same thing, 
Hdt — (p,dx + pidy a paz) 


is invariant under Lorentz transformations. The same is true 
of the total differential operator 


P) P) P) P) 
d= ~dt + (<-ae + xdy + < dz) 


applied to an arbitrary function of t; x, v, 2 Hence the corre- 
spondence (2.5) necessarily implies the further relations 


OD pe OO 
Px 19x’ Py 1 dy’ P: 1 dz’ 


which are to be given the analogous interpretation. 
A homogeneous plane wave 


(2.6) 


= as ell tart py + x2) (2.7) 


is simultaneously a characteristic function of the four mutually 
commutative operators (2.5), (2.6), which has as characteristic 
numbers 


H=hv: p,=ha, ~p,=hB, p, =hy. (2.8) 


It represents a state in which the energy and linear momentum 
of the quantum possess these sharply defined values. 

In classical mechanics the laws governing the motion of a 
particle are known as soon as we express its energy H in terms 
of the ‘canonical variables" xyz, pep,p, In Newtonian 
mechanics the Hamiltonian function for a free material particle 
of mass m 1s 


yu Pet Py t be 


2m 


(2.9) 


on employing the transition scheme developed above we obtain 
the corresponding wave equation 

hoy FH >? o? 3? 

7 — = Ap=0 (A= isnt <3): (2.10) 
(2.7) is a solution of this equation provided the values (2.8) of 
energy and linear momentum satisfy equation (2.9); in this 
sense (2.9) and (2.10) are equivalent. But the equation (2.10) 
is linear and has as its most general solution a linear super- 
position of simple waves (2.7) ; such a superposition corresponds 
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to a state in which the energy and momentum of the particle 
assume their various permissible values ‘* with a certain definite 
probability.” 

The space vector («, B, y) in (2.7) gives the direction of 
propagation of the plane wave, and the modulus of this vector 
is the wave number p (the number of waves contained in 2 
units of length; 2z/ is the wave length A). Hence by (2.8) 


2h 
the absolute value p of the momentum is equal to hp =": 
” is the phase velocity of the wave; in accordance with (2.9) or 
h 2 
Vv om Lt 


it is hu /2m = h/Am and depends on the wave length or frequency 
(dispersion). Since p= mv, where v is the velocity of the 


ad h — 
particle, the ‘‘ group velocity ”’ = ~ =v coincides with the 
velocity of the particle. eepeaments on diffraction and inter- 
ference phenomena in electron beams, such as those performed 
by Davisson and Germer, have made it possible to test directly 
these relations set up by de Broglie. 

In relativistic mechanics we have in place of (2.9) an equation 
which states that the square of the absolute value of the energy- 
momentum 4-vector is constant and equal to mc? : 

2 
ca (Pa + Py + Pz) = mre? (2.11) 
or 


= ca/m*c? + (pi + py + Pi). 
For the transition’ to a wave equation it 1s of advantage to employ 
the rational form (2.11) of this expression : 


1 ob mc? 
— ¢2 2 + Ais = aa (2.12) 


Here again the group velocity is equal to the velocity uv of the 
particle, but the phase velocity is found to be c?/v; the former 
is always less, the latter always more than the velocity of cae 
In order to return from the relativistic to the “ ordinary ” 

Newtonian mechanics by passing to the limit ¢ — oo, we suet 


first replace H by mc?-+-A, 1.e. & must be replaced by pe yb. 


The differential equation governing light waves can be ob- 
tained from (2.11) by dropping the term on the right-hand side. 
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Hence from the corpuscular standpoint light consists of photons 
or particles of proper mass 0: 
2 


H 
cz — (bi + Py + P2) = 0. 


In accordance with the expression (2.1) for the probability 
density, we are to consider as the vector in unitary system-space 
describing the state of the system the function & in so far as it 
depends on the spatial co-ordinates xyz. The integral of (2.1) 
with respect to the spatial co-ordinates gives the probability 
that the particles will be found ‘ within the volume V at time t.” 
Space and time must be separated from one another; the system 
has at each time t a definite state w(xyz), which will in general 
vary with ¢. The operators which represent physical quantities 
must accordingly be ones which operate on an arbitrary function 
of the spatial co-ordinates. This requirement is satisfied by 
the operators (2.6) corresponding to the momentum co-ordinates, 
but not by differentiation with respect to time, which we have 
associated with the energy. We must instead consider the 
situation as described as follows: from the expression for the 
energy in terms of the canonical variables ~,, p,, p, we obtain 
the operator Ff which represents the energy and which operates 
on the function #(xyz). The equation 


h db 


is then the dynamical law which determines the change in the 
state % in time. 

The separation of space and time offers certain difficulties 
to the development of quantum theory from the relativistic 
standpoint ; consequently, for the present, we base our develop- 
ment on the Newtonian mechanics. 

Our procedure must eventually be modified in another 
important respect : we have here tacitly assumed, for the sake 
of mathematical simplicity but without physical justification, 
that the wave field of a material particle 1s described by a scalar 
quantity %. The modification, which is required in order to 
give an adequate description of the facts of spectroscopy, will 
be made in Chap. IV. 


§ 3. Schrodinger’s Wave Equation. The Harmonic 
Oscillator 


When the particle is moving under the influence of forces 
the kinematic part (2.9) of the energy is augmented by the 
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potential energy, which usually depends on the co-ordinates 
alone and not on the momenta. We must therefore know 
which Hermitian operator acting on #% corresponds to the co- 
ordinate x. I assert that it is multiplication by x; this operator 
is already referred to its principal axes, its characteristic values 
are all real numbers x and finally s(x), or more precisely p(x) dx, 
is the component of the “ vector ’’ associated with the character- 
istic number x (we have here ignored the other co-ordinates y, 2). 
In accordance with the statistical interpretation of the relation- 
ship between physical quantities and operators, our assertion 1s: 


the probability that x has a value between x, and %, 1s | Pybde ; 


this is in agreement with the expression (2.1) for the probability 
density. If V(xyz) is a function of position in the 3-dimensional 
space, e.g. the potential energy, then the physcial quantity V 
is represented by the operator 


yb —> V(xyz) >, 
for the probabunity that V les between Y, and V4 1s given by the 


integral 
\J | Pubdxdyds 


extended over that portion of space in which Vy S V(xyz) S V4. 

The operators corresponding to x, y, 2 commute with each 
other, but the operator Q corresponding to x and the operator 
P corresponding to p, do not. In fact 


FE fai(a)] — x2) = la) 
or PO = OF = Q 1 


t 
where the 1 on the right-hand side stands for the operator 
identity: w(x) —> w(x). Because of this non-commutative re- 
lation between the operators P and Q, p, cannot assume a definite 
value with certainty when x does, and conversely. In fact, if p, 
is known to have the value he with certainty, then the dependence 
of % on x is given by the factor e'**; in consequence of this the 
position x of the particle 1s entirely indeterminate, since the 
probability dw of localization is the same for all points x. 

If V(x, y, 2) is the potential energy of the field in which the 
particle moves, the total energy 1s 


(4) 
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We assume with Schrédinger that in spite of the fact that all 
our variables do not commute we may still apply our rules for the 
formulation of the wave equation; we thus obtain Schrddinger’s 
differential equation 


ho wh _ 

FET dpb + Vive) b= 0. 
We understand by ‘‘ stationary’ or ‘‘ quantum states’ & those 
in which the energy £ has a definite value; they are character- 
ized as solutions of the wave equation which satisfy in addition 
the equation [cf. (2.5)] 


a ee ‘ 
1 ol p 


On setting E = hy, such a ® will have the form e~*”- % where 
the new function denoted by # 1s independent of ¢. This function 
w(xyz), which depends only on the spatial co-ordinates, satisfies 
the reduced equation 


SA + [E — V (xyz) = 0. 


The problem 1s thus reduced to finding values of & and functions 
w + 0 of position which satisfy this equation and are such that 
the integral of dus over the entire space is finite. They are the 
characteristic numbers and characteristic vectors of the Hermitian 
operator H associated with the energy (3.1) in the function space 
of all functions of position w. The characteristic numbers E 
are the possible energy levels of the particles. 

Before going any further into the interprctation of the theory 
we have developed, it will be well to convince ourselves that it 
leads to energy levels which are in agreement with the facts. 
The simplest example is that of the linear oscillator ; with it 
we are dealing with only one co-ordinate x. The potential 


energy is V(x) = 5 and the total energy 


H = 5(& -+ ax’). (3.2) 


The equation for the determination of the characteristic values 
& and the associated characteristic functions ¢ is 


Laie a A (E = se (x) = 0. (3.3) 


2m dx? 2 
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Hermitian polynomials. The solutions of this equation are 
expressed in terms of Hermitian polynomials. The 2 Her- 
mitian polynomial 7,(x) is defined by the equation 


a” 


gute 8") = (— 1)Pe8 + a(x) | (3.4) 
it is of n™ degree and the highest term is exactly x”. The 
nr(x) (n = 0, 1, 2,* + +) constitute an orthogonal sect of functions 
with the ‘‘ density function ” e~*/*: 

+ 00 

fe? nalx)nm(x)dx = 0, m = 1; (3.5) 

— 0 


the functions 


Pr(x) = ETP + a(x) 


are consequently orthogonal in the ordinary sense. To prove 
this we need merely to note that 


+ 0 
(— 1)"{ ER (e#?) - ala) dx 
—- © 
becomes, on integrating » times by parts, 
+ 00 
jew ° a” m(%) 4, 
dx” 
—~ 00 


and the integrand vanishes for m <n. For m = n we obtain 


-+ CO 
n! \ eB 


— wo 
so the equations (3.5) can be supplemented by 
+ 00 


—— on 2(x)dx = 1! V/ Orr. 


From (3.4) we have 


~ 242 n ant} = 24/2 
ee  * Maaa(x) = — (— Yale?) 


a as either i) or zi o:). Since 


and we can consider 
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and 


n-1 


d" a” d 
—____.(yp~x3/2\ —. x?/2 
Tan %e =a Tr ree (iad ae na 


men a 
the first of these interpretations yields the recursion formula 


Nn41(%) a x1 n(x) ~~ AN n—1(%). (3.6) 
From the second we find 


d 2 it 
Wyle 2 na(x)] = EPP + ny ya(X) 
or 


ann 
Ansi(x%) = — a + %7(%). (3.7) 


On subtracting the recursion formula (3.7) from (3.6) we find 
the simple relation 


de a (3.8) 
Differentiating (3.7) and substituting (x + 1)y, for the derivative 
of ma+, in accordance with (3.8), we obtain the differential equation 


an, yfte _ 
dx® at My Se 


The equation for ¢,(€) is comes 
ad, I _ 
dée ba + +(n- t 5) bn = 0. (3.9) 


On going over to a new unit of length by the substitution 
x = af, the left-hand side of (3.3) is equal to the left-hand side 
of (3.9) multiplied by h?/2ma? provided 


h? 1 an? h? 
Ima 4. 2° naa” a 3) = es 


Let w = Va/m denote the classical frequency of the oscillator. 
The first of these conditions determines the new unit of length a: 


: h h 
Ce == —_- ss —————- 
2+/am 2mw’ 


and the second requires that 
| ne ne (3.10) 
It is possible to show that the ¢,(&) constitute a complete ortho- 


gonal system,® and consequently there can exist no further 
characteristic numbers and functions. The oscillator possesses 
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the discrete energy levels (3.10) at intervals hw apart. That the 
lowest energy level turns out to be 4hw instead of 0 ts of itself of 
no significance, as we may always introduce an additive constant 
into the energy, although it is meaningful to assert that the least 
possible value of the quantity H, (3.2), is equal to dhw. 
However, the wave equation not only yields the energy levels 
as characteristic values, but it also gives us information con- 
cerning the probability of localization by means of the character- 


eee 


as the 


istic functions. For convenience we now take «= Vx 
w 


unit of length. When the oscillator 1s in the state described by 
the n“” energy level, the probability that the oscillating particle is 
at a distance x from its position of equilibrium is given by 
e~*/2 +? (x). These probabilities are to be understood as 
relative, and refer to equal infinitesimal intervals about the 
points of comparison x. In particular, for the lowest energy 
level » == 0 the probability density is e~*/?; we can therefore 
no longer say that the mass-point is at rest in the position of 
equilibrium, but rather the probability of tts displacement from 
this position is given by a Gauss error curve. The normalized 
characteristic functions of (3.3) are given by 


alt) = = b(t) 


On expressing any function (x) of position in terms of this set 


bx) ~ Sx halt = Jue dx, 


and the operator belonging to the energy H 1s, as we have already 
seen, expressed in terms of these co-ordinates f, by 


Xn —> h(n + 4) ° xX ,. 
In order to find the operator associated with the co-ordinate x 


we must express x,(x) linearly in terms of the characteristic 
functions themselves ; by (3.6) we have 


XP n = Pasi oe UP ny 


whence 
Yn NY n— ey ay a 
xb, = a Past = y Vn) a apn ik Wasa + Vn Pn 
n n 


The correspondence U(x) > x(x) is thus expressed in terms of 
these Fourier coefficients by 


yo po 
an > VN Xa + Vn + LXnya; 


60 QUANTUM THEORY 
its matrix |lq,,,,|| contains only the elements 


Qun-1—= VM", Inn = Vat). (3.11) 
(On returning to the original unit of length the right-hand side 
must. be multiplied by the factor «.) On applying the operator 
2 to d, we obtain, in accordance with (3.8) and (3.6), 


dx 
i, 
ff nba — dass 


whence 


aa = 1(/n b,y— Vn + ldbays). 


The linear Hermitian correspondence associated with the mo- 


mentum p = : ie is accordingly 


1 ax 


h _ Sect ceh Gs 
Xn > 53\— vn Xn-1 ee vn a 1 X ns) ) 


its matrix ||p,,|| has as its only non-vanishing elements those 
for whichm=n-+1: 


h jee 
Pa; n-1 — 5; V 1, Pn; n+1— 5 V0 + 1, (3.12) 


(On returning to the original unit of length these elements are 
to be multiplied by 1/a.—Terms with the index n — 1 are to 
be omitted when » = 0; in fact, they automatically drop out 
of the above formule.) 


§ 4. Spherical Harmonics 


In order to discuss the energy levels of an electron in a 
spherically symmetric clectrostatic field we must first discuss 
spherical harmonics and their principal properties. 

1. Definition.—Let r denote the distance from the origin in 
the 3-dimensional space with co-ordinates x, y, z, and let r, 6, ¢ 
be polar co-ordinates with polar axis along the positive 2 
direction : 

xtiwy=rsinbe'? zg=rcos 0. 


On setting a homogeneous polynomial u of 6 degree in x, y, 2 
equal to 7'- Y,, Y, depends only on the directional co-ordinates 
6, @ and is a function of position on the unit sphere. If u is 
a harmonic function, i.e. if it satisfies the equation Au = 0, 
Y, 1s said to be a surface harmonic of degree | and the harmonic 
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function u itself is said to be a spherical (or solid) harmonic of 
degree 1. Since in polar co-ordinates 


1 1 
Au == : (ns ) 4+- —An, 
or ee 


r? Or 
1 (0/. du ]  O*u 
Au = sh 5a(sin 858) + sana gah on 
the surface harmonic Y, satisfies the differential equation 
AY,+ 10+ 1)Y,=0. (4.2) 


2. Orthogonality.—On applying Green’s formula to the 
spherical harmonics u = 7*Y,, v= r'Y, on the interior of the 
unit sphere, we obtain the orthogonality relations 


{V¥,Vidw=0, k+l, (4.3) 


in which dw = sin 0d6@d¢ is the surface element on the unit sphere. 
Since the conjugate complex Y, of a surface harmonic is also a 


surface harmonic, the first factor in (4.3) can be replaced by Y,. 
3. Basis.—On writing 


the differential equation Au = 0 becomes 


2 
ey gee 07Uu 


don 7 et 
we see that a homogeneous polynomial u of degree / in &, n, 2 
breaks up into harmonic polynomials u™): 


us dul), (m= —b--sl—1,) 


where u'™ consists of all terms in which the exponents of € and 
n have the fixed difference m. The recursion formula for the 
coefficients of u'™, which is obtained from the differential 
equation Au = 0, further shows that there exists one, and to 
within a multiplicative constant only one, such harmonic wu! 
Accordingly, there exist exactly 2/-+ 1 linearly independent 
surface harmonics of degree 1; we may take them to be the 
Y™) defined by 


ui™) = r! ° uy. 
Writing 
Ul) == (x — ty)-™* P= (x + ty)" Py 
and r placing 
(x + ty)(x — ty) by P— 2h, 
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P and P, depend only on r* and z. Hence on taking r = 1 
we have 


yim) = e™? (sin 8)—™ » P\™ (cos 8). (4.4) 
For m= —I1 we take P=1, and for m=+1, Pi =1; 
P(z) = (1 — 2*)' for this latter case. Since Y depends on 
¢ only in the factor e*™?, 


[YPVOMda = 0, m’ +m. (4.5) 


This basis Y'™, in which the z-axis occupies a preferred position, 
is accordingly unitary-orthogonal. 

4. Completeness —That the totality of surface harmonics 
constitute a complete orthogonal system on the unit sphere can 
be proved by showing that any polynomial in x, y, 2 on the 
sphere can be written as a sum of surface harmonics. Now 
the general polynomial of degree / contains 


(I+ ++ (L—Y+++-41 
arbitrary ccnstants. But exactly this same number of linearly 


independent homogeneous polynomials are contained in the 
expression 


r(Yit Vie te + )l= ut (x2 + y? + 2)uye t+: +], (4.6) 


for the polynomials of the form r'Y,, r'Y,_9, + - + are linearly 
independent in virtue of the orthogonality of surface harmonics. 
r'Y, contains exactly 21 + 1 = (1+ 1) + / linearly independent 
functions, and consequently (4.6) contains exactly 


(+ )+4+(-)+0-AM+--, 
as asserted above. 


5. Closed expressions for the surface harmonics.—On_ sub- 
stituting (4.4) in (4.2) we obtain the differential equation 


d?P dP 
(1 — 2") 72 + 2(m — I)z + (7(2 + 1) — m(m — 1)]-P=0 


for the polynomial P = P™ in z= cos 8. From this equation 


dP, 
we find that — satisfies the same differential equation on re- 


az 
placing m by m — 1; we thus obtain the recursion formula 
adap™ 
(m~- 1) _ 
P (z) af 
and the expression 
a” 
POP(2) = (1 — 22)? 


~ dgk™ 
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In particular, the *‘ zonal harmonic ”’ 
d'(1 -- 3)! 
Pz) = P(2) = eS, 
6. Further formule.—- 
jxViY. dw = 0 (4.7) 


unless /—k=+1. For x-r*Y, is a polynomial of degree 
k + 1 and may, in accordance with 4, be expanded in the form 
reUVi i + Yn-1 +: °°). Consequently on the unit sphere 

aoa == Vas + ie -] -{- wee (4.8) 
and the only values of |! 22 for which the integral (4.7) can 
have a value other than 0 is /=k-+ 1. Hence our assertion 
(4.7); it also follows from the above that only the first two 
terms can appear in (4.8). 


Further, we shall also have occasion to use the differential 
expressions 


l/ du ou 
Lis (ys — a) Lyu, Lu, (4.9) 
Ltu = LL zu) + L,(Lyu) + L,(L,u) 


in terms of polar co-ordinates. On setting in 
Ou ou ou 
du = —d —d —az 
‘ ox ale oy yor 02 


the changes dx, dy, dz obtained by allowing ¢@ to increase by 
df and holding r, @ fixed, we obtain immediately 


Similarly, 
. ee .cos 8 2 
== e'?( ee eee 
aN tm (59 merry: 4) 
r) .cos 8 Q 
06 cae sin @ 53)” 
L?=—A _ [eq. (4.1)]. 


Te et — (4.10) 


§ 5. Electron in Spherically Symmetric Field. 
Directional Quantization 


Now back to physics! Consider an electron of charge — e 
revolving about a fixed nucleus of charge Ze situated at the 
origin. For Z= 1 we have the hydrogen atom, for Z = 2 
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singly ionized helium Het, for Z = 3 doubly ionized lithium 
2 

Lit*, etc. The potential energy is V = aot we shall, 

however, for the present take V(r) more generally as any function 

of the radius r. The wave equation for the determination of 

the energy levels is then 


Ap + [E— Vi) = 0. (6.1) 


On expanding in terms of surface harmonics yf becomes a sum 
of terms f,(r)Y, (J = 0, J], 2, +--+). The differential operator 
on the left-hand side of (5.1) sends the 7 term of this sum into 
Y, times 


slag") — Sal + Vein. (6) 


Consequently each individual term must satisfy the differential 
equation separately ; we thus obtain a complete set of char- 
acteristic functions of the form 


b = fi(r) Yr. 
The factor f,(r) depending only on r must be such that (5.2) 
vanishes and frPf(r)filrar converges. Denoting the char- 
acteristic numbers and characteristic functions of this differ- 
ential equation by 
Eny falr) (n=0,1,2,° °°), 


Ey, is a (214-1)-fold energy level, as the expression f,,(r)Y, 
contains 2/-++ 1 linearly independent characteristic functions 
associated with this single characteristic value ; we may choose 
as a basis the functions 


Va? = fut) YOP (m= — Lee 11,9) 


We thus arrive at three integral quantum numbers: the 
‘“vadial quantum number ” n, the “‘ azimuthal quantum number ” I, 
and the ‘‘ magnetic quantum number’’ m. The energy level 
depends only on the first two. 

In justification of this nomenclature we determine the angular 


momentum h& of the electron with components 
hL, = VPi— Py °° 


In quantum mechanics L,, L,, L, are the operators (4.9). 
Hence for 


yim) — fii(r) Vo — ei. (a function of r and 8) (5.3) 
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we have, in accordance with (4.10), 


Lip=m-y, 


and for the general characteristic function 


b= frlr)Y (5.4) 
with azimuthal quantum number / 
L?4=1(1-+ 1)-u. 
Hlence in the state described by (5.4) not only the energy has 


a definite value E,,, but also the absolute value of the moment 
of momentum 


| S2= 1 + 1) |. (5-5) 


The significance of the azimuthal number is that it fixes this 
magnitude. It is indced remarkable that there exist states 
l=0, n=0, 1, 2, +--+ with spherically symmetric character- 
istic functions W = f,o(r) for which the moment of momentum 
vanishes. In the states described by (5.3) not only the energy 
and the absolute value of the moment of momentum have 
definite values, but also the z-component of the moment of mo- 
mentum assumes a definite value with certainty, for then 


| L, <= m. (5.6) 
Since a magnetic dipole moment 
— eh 
8 = 5 
5) Duc = (5.7) 


is associated with the angular momentum hg of the revolving 
electron (the mass of the electron being denoted by uw whenever 
there is danger of confusion with the magnetic quantum number 
m), the influence of & will be felt on subjecting the atom to a 
magnetic field. The existence of the Zeeman effect under such 
conditions can be traced to this cause. A fundamental ex- 
periment to observe the magnetic moment of the electron directly 
is due to Stern and Gerlach. Let a stream of one-electron atoms, 
which are all moving in the direction of the x-axis and are in 
the state (n, /) with energy level £,,, be subjected to an in- 
homogeneous magnetic field in the direction of the z-axis. Let 
the x- and y-components of the magnetic field vanish in the 
(x-z)-plane, in which the beam moves, and let the z-component 
be a function of z alone. A magnetic dipole, the z-component 


' H 
of whose moment is s,, 1s then acted upon by a force eos 
in the positive z-direction. In consequence of (5.6) the atomic 
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beam should be broken up into 21+ 1 smaller beams by the 
force in the z-direction, corresponding to the various values 
m= l, /—1, +++, —l of the magnetic quantum number. 
On performing the experiment on silver atoms in the normal 
state two beams, corresponding to m= +1, were observed; 
the value of the ‘“ Bohr magneton,’ the elementary magnetic 
moment corresponding to one unit of angular momentum, was 


found to agree with the value ne obtained from (5.6) and (5.7). 


Why the unperturbed beam corresponding to m= 0 did not 
appear remained unexplained. 

The older quantum theory, which employed the quantum 
number k = /+ I with values 1, 2,-- -, allowed m to assume 
the integral values from — k to +k; it seemed plausible to 
exclude the case k& = 0, although one was thereby led into 
difficulties on applying the so-called ‘‘ adiabatic hypothesis ”’ 
to the behaviour of an atom under the influence of crossed 
electric and magnetic fields. In the new quantum theory no 
ad hoc hypothesis 1s required for this exclusion, as / can assume 
only the values 0, 1, 2,- + -. But according to either the old 
or the present scalar wave theory there should exist an odd 
number of permissible values of m for given k orl; the exclusion 
of the case m= 0 apparently required by the Stern-Gerlach 
experiment cannot be accounted for on cither theory. Nor 
can we explain the related fact that in the anomalous Zeeman 
effect m may assume either an even or an odd number of values, 
according to the nature of the atom under consideration. 
Obviously something is lacking in our present scalar wave 
theory as well as in the older formulation; we return to this 
point again in Chap. IV, §4. The older quantum theory 
described the situation met above as ‘directional quantiza- 
tton’’; since the absolute value of the moment of momentum 
was hk and the component along the z-axis was hm, it concluded 
that the magnetic axis of the atom could assume only positions 
described by the inclination 6 with the z-axis determined by 
the formula 


cos9=—- (m=0,+1,4+2,°°+, +8). 


ax] 3 


Thus in the case k = 1 we should expect only three possible 
orientations for the magnetic axis: parallel and anti-parallel 
to the field, which we have taken in the direction of the z-axis, 
and perpendicular thereto—unless we empirically exclude this 
latter possibility m = 0 because of the Stern-Gerlach experiment, 
in which case we have but two. In either case we find ourselves 
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faced with a serious dilemma, for the direction of the 2-axis is 
an arbitrary direction in space. In order to avoid this one 
then assumed that the quantization was due to the influence of 
the magnetic field, and consequently the preferred z-direction 
was interpreted physically as the direction of the magnetic field. 
But even so the difficulty 1s not avoided in the limiting case of 
vanishing magnetic field, for the directional quantization should 
be maintained in arbitrarily weak fields. Or stated more 
physically, the radiation mechanism required by the Stern- 
Gerlach effect for the orientation of the atoms, which were 
originally in random orientation and precessing about the 
Z-axis, requires about 10° times as long as the greatest time 
consistent with the observations. The stand taken by the new 
quantum theory on this point is fundamentally different. The 
possible states (7, /) of the atom are described by the functions 
w of the (2/ + 1)-dimensional linear family 


+l 
p= fall) Y= St fult) YP 
ma -- 

or by the vectors of a (2/ + 1)-dimensional space with com- 
ponents x, The s-component of the moment of momentum, as 
well as the component in any arbitrary direction, ts capable of 
assuming only the discrete values hm (m=, l—1, +++, — JU. 
But in a state in which the 2 component, for example, assumes 
the value hm with certainty there is only a certain probability 
“that any other component will assume a definite one of its 
possible values 4:0, A-(41),¢+°:, Ae (4. The name 
‘directional quantization " 1s hardly an appropriate description 
of this situation. ® 

When the electro-static central force satisfies the Coulomb law 
and originates in a nucleus of charge + Ze, the differential 
equation (5.2) for the “* radial characteristic function '’ f = f,,(7) 
becomes 

ar (i+ 1 2m 
(LED OE Dy) 4 (Er + Zetf = 0. 

The character of this equation is unchanged on going over to 
the new dependent variable v defined by rf = e7*" - uv: 


doy dv, { (a? + ~~ 4 2mZe? Ml + Dy = ‘ 


drt “dr he hy yi 


We choose « in such a way that the constant term in the co- 
efficient of v vanishes : 


h?a? = — ImE. (5.8) 
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We know from the general theory of linear differential equations 1° 
that there exist solutions of this equation in the neighbourhood 
of the (regular) singular point r= 0 in the form of a power 
series 


Cag" 


in which the exponent p» begins with a certain value po, which 
need not be an integer, and runs through the values fo, fy + J, 
ftp + 2, °° +. On substituting this power series into the equa- 
tion we find the recursion formula 


{w(u + 1) — U0 + Mjaaa = 2a,( oy = 


) (5.9) 


h? 


for the coefficients a,. In order that it be satisfied for p+ 1=pg 
(a, = 0, a,,, + 0) we must have 


Holo — 1) = Ul + 1). 
We thus have the two possibilities : 
Pp =l+1 or p= —l. 


Considering the first possibility and taking the coefficient @,,, 
of the lowest power as unity, all remaining coefficients can be 
obtained by successive applications of the recursion formula 
(5.9), as the denominator p(y + 1) — Ul -+ 1) never vanishes ; 
let the solution thus obtained be denoted by v. The second 
possibility does not lead to a solution, however, as the denomt- 
nator in the recursion formula for ~ =/ vanishes; the second 
solution of the differential equation can be obtained by quad- 
rature from the first and involves logarithmic terms. 

The power series for v breaks off if for a definite exponent 
= po +1 


Zme* 
Cp = he 
or 
Zme? 


In this case f 1s of the form 


er +!» (polynomial of degree x in 7) ; 


it is finite at ry = 0 and the integral 


[r°flr)f(r)dr (5.11) 
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exists, as is to be required. The corresponding characteristic 
numbers E are the energy levels; on writing m in place of 
n-+1-+ 1 and solving (5.8), (5.9) for & we find 


Z*me4 i 


ee Qh? nn? 


(5.12) 
The integer 2, the principal or total quantum number, is 
subject to the condition 2 >J/, There exist no other solutions 
for which the integral (5.11) converges. 

The energy levels depend only on the principal quantum 
number n; the terms for which 2 is a fixed number and 


= Q, 1, + + +, 2 — 1 coincide in a single degenerate term E,, 
of multiplicity 
n -l 
¥ (2+ 1) = nt 
1=0 


This theoretical result agrees with the empirical formule for the 
Balmer, Paschen, Lyman, etc., series. We find, in fact, the 
expression 


Zak me* 
ne? dah c 
v i 
for the terms measured in wave-numbers (~— = =—)}. The 
2nrco = Aach 


expression for the Rydberg constant R in terms of the fundamental 
constants of nature (the charge and the mass of the electron, the 
velocity of light and the elementary quantum of action) agrees 
numerically with its empirical value. All terms and therefore 
all actual line frequencies vy depend on the integer Z describing 
the charge on the nucleus in such a way that Vv increases in 
proportion with Z. Since the X-ray terms are due to the inner- 
most electrons, which are but shghtly atfected by the outer 
ones, we should expect to find that the hardest X-ray lines, 
arranged in accordance with the atomic number Z, follow this 
law. It was discovered by Moseley and gave a conclusive proof 
of the fact that on going through the elements of the periodic table 
the charge on the nucleus increases by e from element to element. 
This law uncovers with unerring certainty the holes yet re- 
maining in the system of known elements; at present we lack 
but 2 (or 3) elements in the scrics beginning with hydrogen, 
Z == 1, and ending with uranium, Z = 92. 

The characteristic functions associated with these energy 
levels, which determine the relative probabilities of the various 
positions of the electron, can be expressed in closed form in 
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terms of the so-called Laguerre polynomials. The character- 
istic function belonging to the normal state n= 1, /= 0, 1s 
spherically symmetric: * 


1 
ee 5.13 
for hydrogen 
2 . 
gobo (5.14) 
me 


(According to the older Bohr theory, a is the radius of the inner- 
most electronic orbit.) a@ determines the order of magnitude 
of atomic dimensions. In the normal state hydrogen possesses 
spherical symmetry (according to the scalar wave theory—but 
see Chap. IV, § 8). 

The radial characteristic functions 7° f,,(7) do not, however, 
constitute a complete orthogonal system for a given 7 for the 
full domain which we wish to consider: in addition to the 
discrete term spectrum (5.12) we have the continuous spectrum 
covering the whole region E =O. We go no further into this 
matter.?! 


§ 6. Collision Phenomena 


The optical phenomena show that the quantum theory leads 
to the correct energy levels, but they do not lend themselves 
to an attempt to interpret the vector ~% in system space as a 
probability. Collision phenomena, which deal with the de- 
flection of electrons or a-particles under the influence of other 
material bodies, are best suited for this latter purpose. The 
fundamental experiments of Franck and Hertz, as well as those 
of Davisson and Germer, belong to this latter category. 

Neglecting the reaction of the moving particle on the per- 
turbing body, the potential energy due to this latter may be 
taken as a given function V (xyz) of position. Considering 
a one-dimensional problem, the energy of the moving particle is 
then 


] 


We can think of the curve y = V(x) as the contour of a hill 
against which the particle runs. The wave equation for a 


* The normalizing factor 1/ 7a’ is calculated from 
we) 


J [ Jer2riadxdyas = 4n{e-2rlartdr == Wa’, 
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state with given energy & 1s 


dp. on 
ie phe [E — V(x)] p= 0. (6.1) 


If we neglect for the moment the perturbing field V we obtain 
as solutions of (6.1) the familiar de Broglie waves: »& is a linear 
combination of the waves e'** and e~'** proceeding in the positive 
and negative directions along the x-axis, the wave number a 
of which is determined by 


(ha)? = 2mls or ha = p. 


Writing 
2m 
pe V(x) = 1) (x) 
equation (6.1) becomes 
dp : 
ea A (eae oe 
Te 4- [a U(x)] fb == 0. (6.2) 


We now assume that as x -» + oo, U(x) behaves in such a way 
t- OO 


that the integral | [U(x)|dx converges; equation (6.2) then has 


— a 
one solution which behaves for x -» + oc asymptotically like 
e'** and another, which is linearly independent of the first, 
which behaves like e7'#7 in the same region. 
This can most readily be seen by solving (6.2) by the method 
of successive approximations. Let 


b= pti + peters: (6.3) 


and take as the 0“ approximation the function e’*; in general 
ta,, 18 determined in terms of #, by integrating the equation 


dfs, , | 
~ + oe? Wasa 7, U(x) pp. 


Hence 
(0 @) 


fn ya(X) == >- _fsina(s -— §) * UE) Wal) a€. (6.4) 


z 


We restrict ourselves for the moment to a region x = x such 
that 


~{|UGa)iax =p <i. 


To 
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If [y,,(x) | Sa, for all x, the integral (6.4) converges and we have 


Musil) San > 2[|U(E)|de 


we can therefore take ay = 1, a,,, = ga,. Then a, = g" or 
lb,(x)| Sg" for x 2X. 
Consequently the series for y converges at least as fast as the 
geometric series with ratio g. It satisfies the integral equation 
oO 
B(x) — pole) = — ~[sinale— 8) - UG) YE de (6.5 


and is consequently a solution of (6.2). Since 


W(ix)| Sl+g+e74+-- ag 


(6.5) leads to the estimate 


Me) — wl S Ey |U(E) dé, 


from which it follows that g(x) behaves asymptotically for 
x—>-+ oo like p(x) =e". Not only is p~ yy, but also 


- ~ a) for the equation 
oes oe | cosa(x — £) - UE E)aE 


M9 
gives as an upper bound for the absolute value of the difference 
on the left-hand side the quantity 

0 6) 
—; + five ae 
1 —g | 
x 

which approaches 0 as x — + o. 

The solution P(x) which we have found in the region x =: x9 
can naturally be extended over the entire real axis by analytic 
continuation. Since our considerations apply just as well for 
x—> — oo, we know that (x) satisfies an asymptotic equation 
of the form 


h(x) ~ ber + ble" for x-—> —- ow. 
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1e same time we must also have 


ya ~ ta(bere® — be 9), 


ax 


(x) being a solution of the differential equation, (x) is 


d? d2 
Oy + [at ~ U(x)Wb = 0, OY + [at -- U(x) = 0 


iply the first equation by #, the second by # and subtract ; 
nd 

ad, dip dip 

ae gta 


=~ ~= const. (6.6) 
determinant (6.6) has the limiting value 27a for x - + oo 
or x-> — © 

2ia(bb — b’b’), 
ce 

bb — b'b’ = 1. (6.7) 
lows from this that & + 0. On multiplying g%(x) by 1) 


ave a solution % whose asymptotic behaviour 1s described 
le equations 


Yiv) ~ er -+4- ale ** for x-> — oo, 
bv) ~ ae’?* for v—-> + x (5.8) 
ea -:1b,a' == b''b. (6.7) 1s now 
ae a se (6.9) 


| particle of defintle energy runs against the potential energy 
from the left, 106. from x - 00, Whereas in classical 
antics the particle certainly etther gets over the hill or is thrown 
according to whether its initial kinetic energy ts greater or 
than the maximum of V(x), quantum mechanics states that 
is a probability ja\® that it gets over and a probability ja’\? 
itoas thrown back. Furthermore, these probabilities are 
nuous functions of the energy of the particle; the dis- 
nuity of the classical theory is completely broken down. 
» perform the experiment successively with a large number 
articles we find that they are divided into two streams, 
cordance with (6.8.), proceeding in the positive and negative 
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directions along the x-axis; the relative intensities of these 
are given by 1 and |a’|? for x > — oo, respectively, while for 
x» -+ oo there exists only the positive stream of intensity 
|a|2. Equation (7.5) thus expresses the conservation of the 
number of particles and shows that we must consider the square 
|a|? of the absolute value of the amplitude a as a relative intensity 
or probability. 
If the integral 


the solution # is represented throughout the whole space by the 
formula (6.3). In perturbation theory one is usually satisfied 
with the first term y%,. The theory of the familiar experiments 
of Rutherford, in which a-particles are allowed to fly in a given 
direction with given momentum into and be deflected by the 
field of an atom, has been developed by Wentzel in a similar 
manner.}2 The influence of the «-particle on the atom is thereby 
neglected ; on taking it into account we are led to the theory 
of the experiments of Franck and Hertz, giving formule for 
the dispersed particles specified according to their various 
discrete kinetic energies and their various directions. This 
calculation has been carried through for hydrogen by Born and 
Elsasser.8 A very important application of this picture of 
corpuscular waves ‘‘ seeping ’’ through a potential hill has been 
made by G. Gamow and R. W. Gurney and E. U. Condon to 
explain radioactive decay." 


§ 7. The Conceptual Structure of Quantum Mechanics 


The fruitfulness of the theory has been amply established by 
the above applications and the examples given have served to 
illustrate its physical interpretation ; it now seems time to set 
forth its general abstract formulation. 

Consider a physical system of known constitution. [¢ as 
particular state, each individual case of such a system 15 7 oy, 
sented by a vector x of modulus 1 in a unitary system space. ir, Ah 
physical quantity associated with the system 1s represenle by an 
Hermitian form in this space. The fundamental ee which 
we put to the theory is not, as in classical physics, ‘‘ What value 
has this physical quantity in this particular case?” but rather 
What are the possible values of the physical quantity A, and wha; 
1s the probability that it assumes a definite one of these values ;,, 
a given case ?’’ The answer to this question is: /he Probability 
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that A assumes the value « ts the value Ea(r) of the characteristic 
form Ea of A associated with the value a, where the vector ¢ repre- 
sents the case in question and the quantity A is represented by 
the Hermitian form A in the system space. The quantity repre- 
sented by A 1s capable of assuming only those values a which 


are characteristic values of the form A. In accordance with the 
equations 


= Ea (t), Alt) = Laks) 


the sum of the probabilities is 1 and the value A(z) of the form 
A is the mean value or expectation of the quantity A in the State X. 
Since all assertions concerning the probabilities in a given state 
LE are numerically unaltered when x is replaced by € x, where € 
is an arbitrary complex number of modulus 1, we cannot dis- 
tinguish between these two cases. The pure case or state 1s 
consequently more properly represented by the ray xr than by 
the vector x, and we must therefore operate in the ray field in 
system space rather than in the vector field. 

The significance of probabilities for experimental science is 
that they determine the relative frequency of occurrence in a 
series of repeated observations. According to classical physics it 
is in principle possible to create conditions under which every 
quantity associated with a given physical system assumes an 
arbitrarily sharply defined value which is exactly reproducible 
whenever these conditions are the same. Quantum physics 
dentes this possibility. We illustrate this by the example of 
directional quantization. We know conditions under which we 
can guarantee with practical certainty that the atoms of a 
hydrogen gas are in the normal state. Let us therefore assume 
that we can create conditions under which we can be certain 
that the atoms under observation are in the quantum state (7, @) 
with azimuthal quantum number /== 1 and energy E. A 
certain quantity L,, which can, under these conditions, assume 
only the values + 1, 0, or — 1 1s associated with each direction 
gin space. Steri and Gerlach have shown us how to sharpen 
these conditions so that L, takes on a definite one of these values, 
say L,== +1. According to the theory the utmost limit of 
precision is then reached. If x is another direction in space, 
then under these conditions which determine L, and E only the 
relative probability that the quantity L, assumes any one of the 
values + 1, 0, — 1 can be given. Why is it impossible to go 
further and insure conditions under which in addition L, takes 
on a definite one of the values, say 0, with certainty ? Because 
the ‘‘ measurement "’ of L,, which is accomplished by separating 
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the atoms into three classes L, = + 1, 0, — 1, is only possible 
by creating conditions which destroy the homogeneity already 
existing with respect to L,. Polarization of photons is obviously 
somewhat analogous to directional quantization of atoms. The 
conditions for the production of a monochromatic beam of light 
in a definite direction determine the energy and momentum of 
the photons. To each orientation s of a Nicol prism corre- 
sponds a definite quantity A, which is capable of assuming only 
the values +1; if A,= +1 the light goes through and if 
A, = — lit does not. With the aid of such a prism we separate 
out the photons for which A, = 1 without disturbing their 
energy and momentum. The utmost limit of precision is then 
reached ; a monochromatic pencil of polarized light is the most 
homogeneous light possible. If we now place a second Nicol 
of orientation o in the path of this beam, then naturally only 
those photons which have A, = + 1 can pass through. But 
the light which we thus obtain is of the same constitution as 
if the first Nicol of orientation s were not used at all: the con- 
dition that all the photons have A, = + 1 is obviously destroyed 
by the second Nicol. 

Natural science is of a constructive character. The concepts 
with which it deals are not qualities or attributes which can 
be obtained from the objective world by direct cognition. They 
can only be determined by an indirect methodology, by observing 
their reaction with other bodies, and their implicit definition is 
consequently conditioned by definite laws of nature governing 
reactions.’® Consider, for example, the introduction of the 
Galilean concept of mass, which essentially amounts to the 
following indirect definition: ‘ Every body possesses a mo- 
mentum, that is, a vector mb having the same direction as its 
velocity »; the scalar factor m is called its mass. The mo- 
mentum of a closed system is conserved, that is, the sum of the 
momenta of a number of reacting bodies is the same before 
the reaction as after it.’ On applying this law to the observed 
collision phenomena data are obtainable which allow a deter- 
mination of the relative masses of the various bodies. But 
scientists have long held the opinion that such constructive 
concepts were nevertheless intrinsic attributes of the ‘‘ Ding an 
sich,”” even when the manipulations necessary for their deter- 
mination were not carried out. In quantum theory we are con- 
fronted with a fundamental limitation to this metaphysical stand- 
point.*® 

We have already seen, toward the beginning of this chapter, 
that a co-ordinate x and its associated momentum p stand in 
a peculiar relationship to one another: the precise determina- 
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tion of either one of these quantities precludes the precise 
determination of the other. In the state represented by the 


wave function (x) ) [fe pb dx = 1 the mean values x) = <x> and 


Py = “p> are given by 


00 ce ib 
[x Bx) y x)dx and ro eae. 


No loss of generality is incurred by taking these mean values 
as zero; the first can be made to vanish by replacing x by 
x — xX» or w(x) by P(x + x) and the second by replacing (x) 


by e( — be) w(x). The mean values (Ax)?, (Ap)? of (x — x,)?, 
(p — fo)? are then given by 


(Ax)? = fx2B(xip(eiaa 


+ @ 


dtp b dys 


L a 
From these expressions the general inequality 
Ap: Ax .> 3h 


can readily be obtained (I am indebted to W. Pauli for this 
remark); the less the uncertainty in x, the greater the un- 
certainty in p, and conversely. * 

In general the conditions under which an experiment ts 
performed will not even guarantee that all the individuals con- 
stituting the system under observation are in the same “ state,” 
as represented in the quantum theory by a ray in system space. 
This is, for example, the case when we only take care that all 
the atoms are in the quantum state (7, 2) without undertaking 
to separate them, with respect to m by means of the Stern- 
Gerlach effect. In order to apply quantum mechanics it 1s 
therefore necessary to set up a criterion which will enable us to 
determine whether the given conditions are sufficient to insure 
such a “ pure state.’ We say that the conditions @’ effect 
a greater homogeneity than the conditions © if (1) every quantity 
which has a sharp, reproducible value under € has the same definite 


*Cf, Appendix 1 at the end of the book. 
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value under @’ and if (2) there exists a quantity which is strictly 
determinate under ©’ but not under ©. The desired criterion 
is obviously this: The conditions © guarantee a pure state if it 
1s impossible to produce a further increase in homogeneity. (This 
maximum of homogeneity was obtained in classical physics 
only when all quantities associated with the system had definite 
values.) 

In the pure state represented by the vector a = (a,), a quan- 
tity Q represented by the Hermitian matrix Q = '!q,,|| has the 
expectation or mean value 


Q> = LG iFir- 
The numbers | 
Aik = a,an (7.1) 


are the components of a positive definite Hermitian form A of 
trace 1, 1.e. 


(ax) |? oe 24,0; : AK), 


(Positive definite is to be understood here in the weakened 
sense A(t) 20.) It is to be noted that (Q» depends linearly 
and homogeneously on the quantity |!q,,|| under consideration : 


Q = tr (AQ). (7.2) 


If a statistical aggregate A is created by subjecting a large number 
of individuals of the physical system under observation to the 
conditions ©, then the mean value of a physical quantity Q 
will be given by (7.2) where A is a certain positive definite 
Hermitian form of trace 1 which is characteristic for the 
aggregate—even if the conditions © do not guarantee maximum 
homogeneity. The reason for this is that (7.2) is still correct 
if we mix statistical aggregates, each of which does possess 
maximum homogeneity, in any proportions; any statistical 
case may indeed be considered as a mixture of pure states. 
As F. v. Neumann has remarked, this formula (7.2) can be derived 
from the simple axioms !" : 

1. If P, Q are physical quantities and A a real number, then 
APD = ACP? <P + Q> = CP? + <Q). 

2. If the quantity Q is capable of assuming only positive 
values (i.e. if the form Q is positive definite), then <Q, = 0. 

3. If Q is a pure number, i.e. if it is independent of all 
physical conditions, then <Q> = Q. 

Assuming not only that any physical quantity Q 1s repre- 
sented by an Hermitian form, but also that conversely any 
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Hermitian form represents some quantity associated with the 
system, it follows from (1) that 


Q> = 2 FeJie 
i, 


where the coefficients a,, are independent of Q. (We shall 
return to this assumption in Chap. IV, § 9.) The matrix 
A = 'la,,|| must be Hermitian since <Q> is always real. On 
bringing A into the normal form La,x,Z,; (2) requires for the special 
Hermitian forms of the type Q = 24,x,z, that 2a,q; 2 0 for 
arbitrary non-negative values qg;; consequently a, 20 and A 
is positive definite. 

The probability that in the statistical aggregate A the quan- 
tity Q assumes the value « 1s 


w= tr (AE,) (7.3) 


where E, is the idempotent form associated with the character- 
istic number kx, 

We can also distinguish “ pure states’’ among general sta- 
tistical aggregates, “* mixed states,”’ by the fact that they cannot 
be obtained by mixing two or more different statistical aggregates. 
This corresponds to the theorem that an Hermitian matrix A of 
the form (7.1) 1s not expressible as the sum B + C of two positive 
definite Hermitian forms B and C which are not merely multiples 
of A. This can be readily proved on taking the vector a = (a,) 
as one of the co-ordinate axes in system space. The positive 
definite Hermitian forms A with unit trace, te. the statistical 
aggregates, constitute a convex region © in the sense that with 
A and B their ‘‘ centre of mass’ AA + pB (A, p arbitrary positive 
numbers whose sum is unity) belongs also to ©. A point of © 
which cannot be considered as such a centre of mass of two 
points of © distinct from the point in question ts called, following 
Minkowski, an ‘‘ extreme potnt.’'® © 1s the ‘convex core’’ of 
the class © of all extreme points, 1.e. 1t 1s the smallest convex 
domain which includes all the points of ©. We cannot dispense 
with a single extreme point of ©; if we leave out but a single 
point of © the entire convex core shrinks together. We may 
accordingly characterize the pure states as the ‘‘ extremes '’ among 
all the possible statistical aggregates. 

It is often convenient to dispense with the normalization 
tr A = 1; (7.3) then gives the relative rather than the absolute 
probabilities. The simplest statistical aggregate is that one 
characterized by the unit Hermitian form with matnx 1; it 
represents total ignorance. In thermo-dynamics the important 
réle is played by the canonical aggregate A = e7¥i*®. FH is here 
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the Hermitian form which represents the energy, k the Boltzmann 
constant and the number @ the temperature. !® 


§ 8. The Dynamical Law. Transition Probabilities 


Having considered the general probability laws of the quantum 
theory, we now turn to the dynamical law governing the change 
in the state x of a physical system during an interval dt of time. 
The dynamical law states that this change is effected by 
the infinitesimal unitary operator — a ‘ff, where H 1s the 


Hermitian form which represents the energy : 


(8.1) 


The peculiar significance of the energy in quantum mechanics 
is due to its appearance in the dynamical law. We also consider 
this law as a fundamental axiom of quantum theory of universal 
validity. For the matrix X : 


Nik al X Lp, 


which characterizes a statistical aggregate of the pure state 
described by the vector x = (x,) [cf. eq. (7.2)], we obtain the 
equation 
hdx ; P 
os NH — HX (8.2) 
on applying (8.1) and taking into account the fact the H is 
Hermitian. This same equation also governs the change in 
time of a statistical aggregate X for a mixed state.?° 
For the integration of (8.1) it 1s convenient to choose as our 
co-ordinate system the characteristic vectors of H; the corre- 
sponding characteristic numbers £, are the energy levels. We 
call this particular system the Heisenberg co-ordinate 
system, as Heisenberg tacitly employed it in his fundamental 
paper on quantum mechanics. This Heisenberg co-ordinate 
system is in general not uniquely determined; the essential 
point is the decomposition of the system space ® into the 
characteristic sub-spaces ft = R(4’), RR’ = R(L"), - + - as- 
sociated with the various characteristic numbers &’, £’’, «+ >. 
The states represented by vectors x in such a characteristic 
space are called quantum or stationary states; in them the 
energy has a sharply defined value. The cases in which H 
possesses only discrete characteristic numbers include ‘‘ con- 
ditionally periodic motion,’’ the only ones for which the older 
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quantum theory could be formulated. The nomenclature and 
symbolism employed in the following 1s adapted to discrete char- 
acteristic spectra, but this by no means precludes the possibility 
that the spectrum is entirely or partly continuous. Equation 
(8.1) becomes, on resolving it into components with respect to 
Heisenberg’s co-ordinate system, 
hdx, 
ae et Bex == 0 
1 dt a7 nevn 
and has as solution 
Xq(t) =x, ent (EE, = hy,). (8.3) 
This is an explicit formulation of the unitary transformation 
r—> z(t) = U(t)r which the state vector r undergoes in time ¢. 
Since |x,(¢)|?is constant, the probabilities for the various energy 
values do not change in the course of time. The finite law 


A(t) = U(t)XU7 (8) (8.4) 
for the dependence of the statistical state X(t) on the time ¢ 
is fully equivalent to the differential law (8.2). 


The mean value g = q(t) of the physical quantity represented 
by the fixed Hermitian operator Q: 


g(t) = tr [X(t) + Q] 


can, on taking into account the symmetry properties of the 
trace, be written also in the form 


q(t) = tr [X + Q(t] 
Q(t) = U~*(t)QU(t). (8.5) 


Consequently the situation can be described either by con- 
sidering Q as fixed for all time and the statistical state X(t) as 
varying with the time in accordance with the law (8.4)—and 
this is the fundamental stand taken by quantum mechanics— 
or we can take the initial state X as representing the state of 
the system for all time and allow the operator Q(t) representing 
the quantity Q to vary with time in accordance with the law 
(8.5). This latter interpretation lends itself to comparison with 
classical mechanics. (8.5) is equivalent to the differential law 

— = HO — QOH, (8.6) 


for in virtue of (8.2) and (8.6) 


ft ae (AX 0) = te (x40) 


where 
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In particular, the quantity Q 1s constant in time, 1.e. the prob- 
abilities associated with it do not change in course of time, if the 
Hermitian form Q which represents 1t commutes with H. 

In Heisenberg’s co-ordinate system equation (8.5) becomes 


Ymn(t) -F Amn : e— en — vnlt (8.7) 
The matrix Q(t) is thus expressed in terms of components per- 
forming simple oscillations with frequencies vy — v,a. The 
corresponding amplitude is qmn. On going over from the m'" 
to the nt" stationary state the system loses an amount h(v,, — v,) 
of energy; if this energy is radiated as light, its frequency 
is given by 


Vi Ye ee (8.8) 
Classical mechanics collects together all the transitions from 
a fixed level m to all possible levels n = 1, 2, + - - into a single 


state of motion, the motion of the system in the m quantum 
state, whose harmonic components have the corresponding 
transition frequencieS vm, Vme, °° * For any quantity A it 
therefore associates a constant amplitude a,,, with the transition 
m-—»n. But in classical mechanics (for systems with one degree 
of freedom) we have 
Vmn = R*+ w(n), = mM — Nn, 

instead of equation (8.8). On multiplying the two Fourier 
series A, B 

Da, eh and Sb, + eke 

k k 


we obtain the Fourier series C with coefficients 
Ch Dab, (r+s5=k). 


Accordingly classical mechanics associates with the quantity 


C = AB the amplitudes 


Cin = 2 Om, mer Cas wes (r +s=m— n), (8.9) 
whereas quantum mechanics assigns to it the amplitudes 
Cmn = ZA mt tn = S'am, m-r' Donets n° (8.10) 
r 


The difference between these two results lies‘in the fact that in 
(8.9) both factors a, b have the first index m in common, whereas 
in (8.10) the first index of b is the same as the last index of a. 
This is in exact analogy with the difference between the ‘‘ classical "’ 
and the correct Ritz-Rydberg combination principle. This was 
Heisenberg’s starting-point ; the correct combination principle 
indicates the pertinent-fact that the rule (8.9) for the multi- 
plication of amplitudes must be replaced by (8.10). Admittedly 
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such multiplication is not commutative, and it collects together 
amplitudes which the older model assigned to different orbits. 
We denote |an,_|2 as the intensity of the quantity A in the 
transition m—>n. When multiple energy levels occur (‘‘ de- 
generacy'’) only the sum J’ |amn|*?, extended over all indices 
m for which /,, = EF’ and all indices » for which EF, = EB”, 
has an invariantive significance; in such a case this sum ts 
taken as the intensity of A in the transition i’ > EE". If A, 
is that portion of A in which R(/:’) intersects R(L") the sum 


— 


defined above ts the trace of A,A,. 

Consider an atom with one or more electrons and let rt be 
the vector from the nucleus to a representative electron. Then 
q = et, or in case there is more than one electron the sum 
q= yet, extended over the various electrons, 1s the electric 
dipole moment of the atom. In classical electrodynamics the 
intensity of the light of frequency v emitted by the atom is calculated 
from the amplitude q(v) of the harmonic components of q with 
the same frequency v in the following manner.t The rate at 
which energy flows through a surface element do at the point P, 
whose distance from the atom at O is large compared with the 
wave-length, 1s given by 


Iq°(v) |? ° sale, 


_> 

where q* is the component of q perpendicular to OP and dw is 
the solid angle subtended at O by dv. We have further assumed 
that the wave-length under consideration is large compared with 
the radius of the atom. Since each photon of frequency »v 
carries with it energy hv, we postulate that this law is to be 
taken over into quantum theory as follows: the probability 
that an atom in state m goes over into state »’ in unit time and 
emits a photon of frequency v, whose direction lies within the 
solid angle dw, is given by 


yp3 


al: (8.11) 


i 2, 
Qn’ | 


We thus arrive ata definite rule for the calculation of the intensities 
of the lines emitted by the atom. The fact that we can now make 
such a prediction indicates a distinct superiority of the new 
theory over the old. Jn particular, the transition n—+n' does 
not occur if the corresponding coefficient in the Hermitian form 


+ By this we mean that the terms q(vjef’? + g(v)e-f! occur in the harmonic 
analysis of 4. 
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for q ts zero. This constitutes the general selection rule. The 
connection between the state of polarization of the emitted 
light and the direction of oscillation of the electric moment is 
also carried over into quantum theory. But a real derivation 
of our intensity rule can naturally only be obtained by con- 
sidering the question of interaction between the atom and the 
ether; sec § 13. 


Examples: 1. The Oscillator. 
The Hermitian form 


x P(x) P(x) dx, 


representing the co-ordinate x of the oscillating particle has, 
as we have already found [(3.11)], the coefficients 


Gan = 0 if n’ -n+1; | 
—,| == fe +1) (8.12 
Qn, n—-) ~~ Ea Gn, n+1 ~~ (ee a a Poe | 
2m) 2mw 


with respect to Heisenberg’s co-ordinate system, in which the 
energy is referred to its principal axes. We thus obtain the 
selection rulen—+>n-+1; the quantum number n can only change 
by + 1, the oscillator then absorbing or emitting a photon of fre- 
quency v=w and energy hw, tn accordance with (3.10). The 
selection rule makes it clear why no higher harmonics are ex- 
cited in the simple oscillator. We have also found that the 
matrix \|Pan'||, which represents the linear momentum in Heisen- 
berg’s co-ordinate system, is given by (3.12) 


_ _ 1 [hmwn == femoln + 1) 
Pun-1 = 1 ’ Pn, n-1 = | “= (8.13) 


2 
Pav =90 for n +$n+1 
2. Electron in spherically symmetric field. 
The result (4.7) for surface harmonics yields the selection rule 


[+l+41]1 (8.14) 


for the azimuthal quantum number |; for 1 = 0 only the transition 
0 — lis possible. On introducing the magnetic quantum number 
m as in §4, the characteristic functions #”) depend on the 
meridian angle ¢ about the z-axis only in the multiplicative 
factor e'™* ; here 


xtiy=rsin 0-e*¢, g=rcos 0. 
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In order to obtain the dependence of the matrices q, + iqy, 
qx — 14y, Jz On the transition m-—>»m’ we must evaluate the 
integral 

an 


fe(ap) e(— mg) e(m'$) dé, 


0 


where a=1, —1, 0, respectively. The integral vanishes 
unless m’ + a= m. The only components of q. + 1q, which do 
not vanish are those corresponding to the transitions m->m — 1 
in which the magnetic quantum number decreases by 1; _ for 
Gz ~ 14y,m>m-+1; forq:,m—>m. 

This last selection rule cannot be obtained from the spectra 
themselves as long as the terms corresponding to different 
values of m (|| Sl) coincide. But these terms are broken 
up into their various components by a homogeneous magnetic 
field in the direction of the z-axis (Zeeman effect). On “ longi- 
tudinal’ observation of the light emitted in the z-direction we 
find instead of the one line (1, 1) > (n’, l’) several left- and right- 
circularly polarized components, the former of which arise from 
the transitions m-—» m — 1 and the latter from m—>m-4+ 1. 
On “ transverse '’ observation, e.g. along the y-axis, we find 
two transverse linearly polarized lines arising from m—>m +1, 
and in addition a longitudinally (1.e. along the z-axis) polarized 
line corresponding to the transition m-—»m. (Polarization as 
here used means the direction of oscillation of the electric dipole, 
and therefore the direction of the electric field strength. ) 

In the term spectrum of the alkali elements, which 1s, however, 
typical in this respect, even for the more complicated spectra 
of the other elements, we distinguish between several series by 
means of the letters s, p, d, f, g,° °°. Each series consists of 
infinitely many terms which we number in the direction of 
increasing frequency by the integer 2. It 1s found convenient 
to let 2 run from 1 on in the s-series, from 2 on in the p-sertes, 
from 3 in the d-series, etc. The values of the terms 2s, np, 
nd, + + are then given by the “ hydrogen-like "’ formula 


RR 
(mn + x)?’ 


in which k = Ky, Kp, Ka, * * * 1S a correction term depending but 
slightly on », the numerical value of which but rarely exceeds 
1/2 and is very close to 0 for high series (f,g,...). Only terms 
lying in neighbouring series combine to produce a line, i.e. an 
s-term combines only with a p-term, p only with s and d, d with 
p and f, etc. In particular, the transitions np — 1s give rise 


86 QUANTUM THEORY 


to the principal series, which also appears in absorption, nd —> 2p 
to the lines of the diffuse series, ns > 2p to the sharp series, 
and nf — 3d to the Bergmann series.*! 

The alkalies A are univalent, i.e. in chemical reactions only 
one electron, the valence electron, plays a role; the others, 
together with the nucleus, constitute an inert closed shell. It 
is therefore reasonable to assume that the optical spectra of 
the alkalies are caused by quantum jumps involving only this 
valence electron, while the core At remains in its normal state. 
We have seen above that hydrogen in the normal state ts re- 
presented by a spherically symmetric wave function #; we 
therefore assume, disregarding the reaction of the valence 
electron on the core, that this feature of the core being ‘‘ closed ”’ 
is to be expressed by ascribing spherical symmetry to it.* We 
have then to deal with the problem of an electron in a spherically 
symmetric field, which we have already discussed above. In 
accordance with the empirical combination principle and the 
theoretical selection rule for the azimuthal quantum number J, 
the s, p, d, f,* + + terms are to be taken as having / = 0, 1, 2, 3, 
- + + respectively. 2 then runs from / + 1 on in the series with 
azimuthal quantum number J, as in hydrogen.** 


§ 9. Perturbation Theory 


The problem with which perturbation theory 1s concerned 1s 
the following : Let the energy H consist of two terms H==H-+-eW, 
the second of which, the perturbation term eW, is small compared 
with the first ; this we express by the “ infinitesimal ’’ numerical 
constant ¢, of which powers higher than the first are to be 
neglected. Assume that the quantum problem for the ‘ un- 
perturbed system ”’ with energy H has already been solved, so 
that the Hermitian form H has already been brought into 
normal (diagonal) form, and let ®’, R’, + + + be the character- 
istic spaces of H with characteristic numbers f’, E”’,+ + +. The 
problem is to find the solution of the equations for the ‘ per- 
turbed system ”’ with energy H. 

In order to illustrate the typical difference between degenerate 
and non-degenerate systems we first consider the system space as 
2- instead of oo-dimensional; then 


0 E, + EW, 


* Why He and not H is the first closed atom is only to be understood as 
the result of a profound modification of wave mechanics ; see Chap. IV. 

** Concerning the introduction of the ‘‘true quantum number”’ for 
elements other than hydrogen, see Chap. IV, § ro. 
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If &, += E, the unitary transformation which brings H into 
diagonal form differs from the identity only by terms of order e. 
Consequently the probabilities |x,|?, |x,|* that in the pure state x 
H has the values £,, E, will change only by amounts of the 
same relative order €; they remain constant to the same ap- 
proximation with which eW may be neglected in comparison 
with H. But the situation is quite different for degenerate 
systems, for which E, = E, = E&, for the principal axes of H 
are then indeterminate and this arbitrariness is expressed in 
the ‘‘instability ’’ of the system under the influence of a per- 
turbation. We set up that normal co-ordinate system e,’, e,’ 
in which W assumes the diagonal form; the co-ordinate vectors 
are then also characteristic vectors of H, since F, = E,. But 
these vectors can obviously differ arbitrarily from the original 
co-ordinate vectors €}, @,, whereas the energies hv,’, hv,’ can only 
differ from E by a term of order e. On returning to the original 
co-ordinate system we have 


X= Ay, ° e(— vt) + Ayn’ e(— vet), 
X_ == Ag, C(— vy t) + Age * e(— vet), 


where Q, = (@,;, @e1), Q_ = (@,9, Gog) are two mutually per- 
pendicular vectors whose directions coincide with those of e,’, e,’. 
The probabilities for the two states e,, @, vary periodically in 
time with the small beat frequency v,’ — v,’ (resonance between 
states @,, C2). Quantum states with the same energy are therefore 
in resonance with one another. The magnitudes of the components 
of yin the characteristic spaces R’, RR", + + +, 1.e. the probabilities 
for the various numerically different values of H remain ap- 
proximately constant under a small perturbation, but this is 
not the case for the absolute values a of the individual com- 
ponents x, resolved along the axes of an arbitrary Heisenberg 
co-ordinate system of the unperturbed system. 

In accordance with the foregoing we can formulate the 
perturbation problem in two forms: I. Determine the change, 
due to the perturbation, in those states in which the energy 
H of the unperturbed system is determinate. This formulation 
has a sound physical interpretation if we consider the perturba- 
tion as acting during a time interval ¢t,, f,. We then find how the 
probabilities for the various quantum States change under the 
influence of the perturbation.** I]. Determine the quantum 
states and energy levels of the perturbed system, 1.e. the char- 
acteristic values and characteristic spaces of H. We ask in 
particular how the terms are broken up and displaced under the 
perturbation. We consider II first. 
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We first decompose the Hermitian form W into two parts: 
W,+V. To the first belong those portions of W in which 
a characteristic space ’, R”’, - - - of A intersects itself, and to 
V those in which two different characteristic spaces intersect. 
If the characteristic values of H have but finite multiplicity 
the problem of bringing W’, that part of W in which §’ intersects 
itself, into diagonal form deals only with the space R’ of a finite 
number of dimensions. If §’ is not simply a one-dimensional 
space, the resonance phenomena mentioned above will appear. 
The co-ordinate system, consisting of characteristic vectors of 
H, is now more precisely specified, for now W, also appears as 
a diagonal matrix; let E, be the characteristic values of the 
H + eW, = H, so obtained. The single term value E’ asso- 
ciated with R’ has in general been resolved into as many different 
characteristic values E, of Hy as there are dimensions in the 
sub-space ft’. 

The remainder V = ||v,,,|| of the matrix is such that v,,, == 0 
if the characteristic values —,, £, of H are equal. The in- 
finitesimal unitary rotation 


8x =e-Cx, C= [lemnll, 
of order € transforms H into H + 6H where 
6H = e(HC — CH) ~ e(HC — CH). 


On choosing this transformation in such a way that 6H = — eV, 
H = H, + €V goes over into Hy; this can be accomplished by 
choosing c,,, = Oif £,, = E, and 


Umn 


Cmn E,, Fae i 
otherwise. The characteristic values E, of Hy are therefore the 
energy levels of the perturbed system of energy H if we neglect terms 
of order €?. 

W, can be considered as the time mean of the perturbation 
terms, averaged over the motion of the unperturbed system. 
For by (8.7) the mean value of the clement a,,,(t) of the matrix 
A(t), which represents an arbitrary physical quantity of the 
system, 1S @,, or 0, according as v, =v, or not. In statistics 
angular brackets are often used to denote the mean value of 
a quantity ; we may therefore write 


W = <W>, Hy = <H>. 


The solution of II naturally provides an answer to the 
question I. But it 1s more convenient to employ the method of 
variation of constants for the calculation of the effect of the 
perturbation over a limited time interval—the smaller the 


THE PROBLIM OF SEVERAL BODIES 89 


constant e, the longer we may take this time interval to be. 
Assume that at time ¢ = 0 the system 1s in the quantum state 
0 and that the perturbation begins to act at this time; we ask 
for the probability that the system will be found in the state 
nm at time ¢. That is, we seck that solution of the equations 


Oe an ty EE Wa im (= 0, 1,207 °) 


— 
. 


oj pam 


which reduces to 
Lee Ay = X_ = ' ag -.: Q 
at time ?= 0. Writing 


Me MO yal) 
the equations for €, are 
Ll; E 
ee ee ey Sia: eil'n — Ym)! - 
1 E h ee a ) 
for ¢ == 0, €, = 0. Neglecting terms of order e?, we can take 
the initial conditions 


&o =], f= fot eee 0 
as the 0" approximation; on substituting these values in the 
equation we obtain as the first approximation 


(v, >t: Vo). 

On setting vg — v, = v, the desired probability is 
»/&\? 1 -- cos (vt 

eat = [Eal® = 2G) | al 


= ott 608 tH (9.1) 


g,-2 geet tt 


It 1s to be noted that in accordance with this result the probability 
of transition from state 0 to state n is determined by |Ho,|?. In 


the case of resonance (vy, = vy) the transition probability in- 
creases at first with the square of ¢: 


eal? = (2) + Wool? 


§ 10. The Problem of Several Bodies. Product Space 


A physical system consisting of two particles of masses m, m’, 
co-ordinates xyz; x‘ y's’ and linear momenta p, p’, has as 
its Hamiltonian function 


_ i! 2 2 I ‘ : ‘ 
He (pet Py + Pi) + 5 | op, ie) 


m 
+ V(xyea; x'y's’), (10.1) 
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ere V is the potential energy. We assume, as in the older 
ysics of central forces, that we are here dealing with an action 
a distance so that the potential energy depends only on the 
1ultaneous positions of the two particles. This assumption 
turally breaks down when, in accordance with the theory of 
ativity, we take into account the finite velocity of propaga- 
n of the disturbance, which requires the introduction of a 
d. The wave function # of the system will depend on all 
co-ordinates xyz; x’y’2’ in addition to t; the operators 
‘responding to these functions in the domain of such functions 


are multiplication by x, +*+; x’, +++, and to the linear 
ee h 9 h 9- 
menta correspond the derivatives = —, a as 
1 ox 1 ox 


om (10.1) we then a the wave equation 


ee a = Ap — Vv b= 0. (10.2) 
> must ask for the ails that the one particle is to be found 
a point P and, simultaneously, the other 1s to be found ata 
int P’. The probability density is accordingly to be computed 

a 6-dimensional space with co-ordinates xyz; x ys 
Jeed, the wave field is not to represent directly occurrences 
cng place in physical space, but 1s to determine the appear- 
ce at definite positions or with definite energies and momenta ; 
sre 18 consequently nothing absurd in the fact that its medium 
this abstract 6-dimensional configuration space. 

In order to be independent of the special procedure by which 
> scalar wave mechanics puts together two systems a, BD to 
m a single system ¢, as suggested by this example involving 
> Hermitian forms representing the co-ordinates and momenta 
the two systems, we must first discuss the multiplication of 
wces from a purely mathematical standpoint. 

With each vector r = (x,) in a space # of m dimensions and 
>h vector ) = (y,) in a space © of » dimensions there is 
sociated a vector 3 =r X YH with components 


Sik = XV k (10.3) 


an m- n-dimensional space T = R X ©, the product space. 
e components are here numbered by means of the index 
r (tk) =I. The totality of vectors 3 =x x y do not them- 
ves constitute a linear manifold, but their linear combinations 
the entire product space {. With the linear correspondences 
in and Bin ©: 


/ ’ 
Xe = Da, Xi, Vie = OOK Ve 
i k 
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is associated a linear correspondence C = A X Bin: 
Xi Vy = 245% Dir Xi Vey 
or ; 
Zp = crit Cy. = ay by, = [L = (tk), UU = (0'k')]. 


Naturally, to this multiplication corresponds the law of com- 
position 


(A x B)(A, x B,) = (AA, X BB), 


where A, A, are correspondences of Rt on itself and B, B, are 
correspondences in ©. A co-ordinate system in §R and one in 
© together determine a co-ordinate system in ZY; if the co- 
ordinate system in is subjected to the transformation A and 
that in © to the transformation B, then the co-ordinate system 
associated with them in T undergoes the transformation A x B. 
In accordance with the equation 


Axe) ~= UX, Ve + 1° de, 


to the infinitesimal correspondence H in ®, ¥ in © corresponds 
the infinitesimal correspondence 


(H x 14) + (Le Xx F) (10.4) 


in J, where 1,, 1, denote the unit matrices in R, G, respectively. 
All of the foregoing 1s applicable to arbitrary vector spaces. 
When ‘Rk and © are both unitary spaces, then & ts also, for by 
(10.3) 

ssp LEK LUV 
is an invariant if LZ,x,, Ly,v, are; A xX Bis unitary if A and 
B are. 

Accordingly, two physical systems @ and 8 are compounded 
to form a total system ¢ as follows. The system space & of ¢ 
is R X GS, where ¥ is the system space of a and © of b. Let 
the arbitrary physical quantity « in ® be represented by 
the Hermitian form 4A; on replacing all these forms A by 
A X 1,, where 1, is the unit form in an arbitrary space ©, there 
exist between these latter exactly the same relations as between 
the A, so that from the solution of a quantum problem in ® 
there arises a solution for the corresponding problem in R® x 6, 
but there exists no real distinction between the two. In the 
system ¢ obtained by composition we have therefore to as- 
sociate the Hermitian form A xX 1, with a quantity « of a and 
1. x B with B of 6, where A, B are the forms associated with 
a, Bin R, G, respectively. The totality of quantities of the 
composite system ¢ is obtained by starting from the quantities 
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belonging to the component systems a and Bb and multiplying 
and adding them together in all possible ways. The quantities 
a of a commute with the quantities 6 of b, for 


(A x 1,)(1, x B) = Ax B= (1, x BYA x 1,). 


We refer to the content of these last two sentences when we 
say that ¢ consists of two kinematically independent parts a and b. 

The two systems are dynamically independent if the energy 
H of the composite system is the sum of the energies H), H(*) 
of the partial systems : 


H=(HM x I+ (1 x He), 


Siti cee wud : at 
The infinitesimal unitary correspondence =--:H in the total 


th 
system space is then that one which is due to the infinitesimal 
di at . des 
unitary correspondences preci yA a” in the two original 


system spaces [(10.4)]. If H@) and H') are both in diagonal 
form, then H is also, and the characteristic numbers are given 
by 

E, = EM + EP or == of +o [= (ib)] 


If we have a pure state for the total system which 1s repre- 
sented by the vector c of absolute value 1 and components 
Cx, and if QO = 1 943° | is an arbitrary quantity in @, then the ex- 
pectation of Q in the pure state ¢ is 


Q> = Lien linen’. 
This has the form (7.2) with 


A= |lau|| = ||Zéaeall 
A(r) is the Hermitian form 
3 eins? 


in Rt. But we see from this that we are not dealing with a pure 
state in @, for a;, will not in general have the form a,é,. Con- 
ditions which insure a maximum of homogeneity within ¢ need 
not require a maximum in this respect within the partial system Q. 
Furthermore: if the state of @ and the state of 6 are known, the 
state of ¢ 1s in general not uniquely specified, for a positive definite 
Hermitian form ||a,;, .|| in the product space, which describes 
a statistical aggregate of states c, is not uniquely determined by 
the Hermitian forms 


Dix, t’ky D ik, tk’ 
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to which it gives rise in the spaces R, ©. In this significant 
sense quantum theory subscribes to the view that ‘ the whole 
1s greater than the sum of its parts,’ which has recently been 
raised to the status of a philosophical creed by the Vitalists 
and the Gestalt Psychologists. 

The kinematically independent parts into which a system 
can be resolved need not be spatially separated, nor need they 
even refer to different particles. We can, for example, resolve 
a single particle, whose physical quantities can all be expressed 
in terms of x, y, 2; pz, Py, Pz, into three partial systems with 
fundamental quantities x, p,. | y, Py|s, pz For quantities 
which belong to different partial systems, for example a quantity 
which can be expressed in terms of x, p, alone and one which 
is in terms of y, py alone, commute with each other in the sense 
of matrix multiplication. 

In the perturbation theory we are usually concerned with a 
system which consists of two kinematically independent parts 
and which are almost dynamically independent. Disregarding 
the interaction eW for the moment, let hv, and hp, be the energy 
levels of the two parts, so that h(v, + p,) are the energy levels 
of the unperturbed total system. On writing in equation (9.1) 
$= (m, r) in place of 0 and s’ = (n', r') in place of 2, whence 


Kore (Vv, ais Pr) (Vy Ie Pr’) = Van + Per’; 


Van’ = Vn Vat) Pret = Pr — Pr, 


we find as the probability that the total system goes over from 
the state s to the state s’ during time ¢: 


€\? 1 — cos (v, 4) rye _ 
2) ems Lams da | Winr, n’r’) 2. (10.55) 
h (Van? =f Pre’) | 
The probability that the first system will be found in the state 
n’ after time ¢, the total system having been in the state s = (v7) 


originally, is obtained from (10.5) by summation with respect 
tor’. 


§ 11. Commutation Rules. Canonical Transformations 


The development of wave mechanics in §§ 1-3 went beyond 
the general scheme of §§ 7 and 8 in that it employed certain 
specific Hermitian forms to represent the co-ordinates and 
momenta of the particle. We are now interested in secing how 
this can be formulated in an invariant manner, without recourse 
to any special co-ordinate system in system space. 

For the Hermitian forms q, p representing a rectangular 
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co-ordinate and its associated momentum we postulate the 
commutation rule 


h 
Pg — 4p = =I. (11.1) 


If the system has only one degree of freedom, these two quantities 
appear as canonical variables in classical mechanics. All physical 
quantities of the system are then functions of p and g; in order 
to avoid complications we restrict ourselves to polynomials f in 
p and gq, and assume, in particular, that the Hamiltonian function 
H has this form. What are we to understand by the derivatives 
f, and f, of f with respect to p and gq in this domain in which 
p and g are not commutative in multiplication ? We should 
in any case require that differentiation with respect to g should 
obey the following postulates : 


(1) pp = 9, q = 1; : 

(2) (f+ ee=f +g, and (af), = a+ f,, where # 1s anumber; 

(3) (fela=h eth & 
We see immediately that these conditions uniquely determine 
the derivative of a polynomial f, unless they happen to lead to 
contradictions. But that they do zot lead to contradictions 
can be seen from the fact that they are obeyed by the definition 


ih f, = fo — pf (11.2) 
(1) follows immediately from the commutation rule (11.1), and 
the linearity (2) of the process ts evident. (3) 1s proved by the 
formula 


(fg)p — plfe) = flep — pe) + (fp — pfg 


which involves only the distributive and associative character 
of matrix multiplication. Similarly we can show that 


— ih+ fy = fq — af (11.2) 
The fundamental dynamical law gives us the equation (8.6) : 
hdf 
a Hf — fH 


for any Hermitian form f. On applying this equation to p and q 
—which obviously suffices to establish the corresponding result 
for any polynomial f of p and q—and comparing it with the 
formule (11.2) applied to the particular function H, we are led 
to the familiar Hamiltonian equations of classical mechanics : 


dq _ dp 
a= Ap =~ He (11.3) 
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It 1s a universal trait of quantum theory to retain all the relations 
of classical physics ; but whereas the latter interpreted these re- 
lations as conditions to which the values of physical quantities were 
subject 1n all individual cases, the former interprets them as con- 
ditions on the quantities themselves, or rather on the Hermitian 
matrices which represent them. This is the more significant 
formulation which the new quantum theory has given Bohr’s 
correspondence principle. 

The commutation rule (11.1) is of a rather remarkable 
nature. It is entirely impossible for matrices in a space of a 
finite number of dimensions, and it alone precludes the possi- 
bility that in an co-dimensional space q (or p) have only a discrete 
spectrum of characteristic numbers. For on referring gq to its 
principal axes 


g=||@mnll, Yun == 4m Imn =O (m= 2); P= ||Pmall, 


the left side of the commutation rule has the components 
Pimn({n — Ym); hence the main diagonal consists of nothing 
but zeros! The question arises as to whether it can be con- 
cluded from (11.1) alone that the forms representing q and p 
can always be given the form 


i D(x) Y(x) de, fae AO ax 


for an arbitrary vector yp with components (x) on employing 
an appropriate co-ordinate system in system space. We shall 
see in Chap. IV, § 15, that, on introducing a certain irreducibility 
condition, this is in fact the case. 

On taking into account the three space co-ordinates g, and 
their associated linear momenta p, (a = 1, 2, 3), we have in 
place of the one commutation rule (11.1) the following : 3 


PoPp — Paps = 0, Qa4p — 9p9a = 0 for all a, B | 
- h = ] (a oem B) (11.4) 
Pada — IpPa = 9s8, Ong ={o (a -+ B). 


The same commutation rules apply to the case in which we have 
several particles, the only difference being that then « runs 
through 6, 9, + - + values, according to the number of particles, 
instead of 3. These commutation rules are the necessary and 
sufficient condition that the dynamical law, which governs the 
time rate of change of the state vector ¢ in system space, leads 
to the Hamiltonian equations for the ‘ canonical variables ”’ 
da, Pa tepresenting the co-ordinates and associated momenta of 
the various particles composing the physical system—whatever 
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the dependence of the Hamiltonian function H on these quantities 
may be. 

In classical mechanics the Hamiltonian equations are invariant 
with respect to canonical transformations.** In a system of 
f degrees of freedom the transition from a set of variables 9,, pa, 
describing the state to a set gi, p, (a=1, 2,-°:, fpisa 


canonical transformation if the difference 


LP dd =, DP ta (11.5) 


is a total differential. If, for example, the g, are subjected to 
a transformation 


ge = $.(91 °° * 9) 


among themselves, the p, must transform as the components 


of a ‘ covariant vector’’ in g-space in order that the whole be 
a canonical transformation (‘‘ extended point transformation ’’) : 
, Ip 
Pa ae 4 2g, Pp. 


Perhaps the simplest canonical transformation is that in which 
the roles of g and p are interchanged : 


?x= — as qx = Pw 

The canonical transformations constitute a group [cf. III, § 1}. For 
the identity, i.e. the transition from (p, g) to (p, q), is a canonical 
transformation; the inverse (p’, 9’) —>(p, q) of a canonical 
transformation (p, g) > (p’, 9’) is also canonical; and from the 
canonical transformations (p, q) > (p’, q'), (p', g') > (p”, ¢”) 
it follows that the resultant transformation (p, g) > (p”, q’’) 
is also canonical, for if 


DP 84, = » PAGa; » Pa44y — P84 


are total differentials their sum 


aP.4g, — SPA4, 
is also. 


An infinitesimal canonical transformation is one in which 
p’, q’ differ infinitely little from p, g. We can consider it as 
an infinitesimal deformation of the 2f-dimensional (p, g)-space 
which takes place in the infinitesimal time interval ¢ = 8¢. We 
introduce the components 6p, 8q of the displacement vector by 
means of the equations 


Pu — Pa = &* 8p,, Qu — Ia = &* 8a * 
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Since (11.5) must be a total differential, 
Pan + 24x dp, = aT (11.6) 


must also; in our case J must differ only infinitesimally from 
Pa We may therefore write 


T= DPada — eos 


considering S as a function of p, and g’, we have, in accordance 
with (11.6), 


Ss as 
Pa = Px aq) Ga qa Px 
or 
oS oS 
ea, Oe | 
p. ve 85 os (11.7) 


Since we may Icgitimately neglect terms of order ¢#, we may 
identify gq, with g, on the right-hand side of these equations. 
We call S the generating function of the infinitesimal canonical 
transformation. 

In accordance with the Hamiltonian equations, the state 
of a system, represented by a point (p, q) in (p, qg)-space, goes 
over into a state (p + dp, q+ dq) during time dt. If we follow 
this transition for all possible initial states (p, q) we obtain an 
infinitesimal deformation of the space whose points represent 
the state of the system. The Hamiltonian equations assert that 
this deformation 1s an infinitesimal canonical transformation with 
generating function Fl- dt. It follows from this without any 
calculation that these equations have a significance which 1s 
independent of any particular choice of canonical variables. 

Now in quantum theory the Hamiltonian equations (11.3) 
assert that the state vector x in system space undergoes the 
infinitesimal unitary rotation 

at 
dy == — <= + Hr, (8.1) 
so the infinitesimal canonical transformation of the quantities 
p, q is here obtained by subjecting the argument x in the Her- 
mitian forms representing them to the infinitesimal rotation 


e. oy = — *. - Sr. 
We find that the increments of the quantities p,, q, are in fact 
1€ 
€ . ga = =(Squ — quS),  € » BPa = 5¢(SPa — Pad); 
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and, in virtue of the commutation relations (11.4), this agrees 
exactly with (11.7). On generating a finite canonical trans- 
formation by the successive application of an infinity of in- 
finitesimal ones we arrive at the result that the unitary corre- 
spondences of system space on itself in quantum theory : 


t= UE 


correspond to the canonical transformations of classical mechanics ; 
more precisely, only those for which the matrix U 1s expressible 
in terms of the matrices p, q, but we may for the present pass 
over the question as to whether every matrix U can be obtained, 
or at Icast arbitrarily closely approximated, in this way. Since 
the commutation rules (11.4) remain unchanged under rotations 
of the normal co-ordinate system, they are valid for an arbitrary 
set of canonical variables. This is also evident from the fact 
that they are the conditions that the dynamical law (8.1) lead 
to the Hamiltonian equations 
“4a = zis a ues (11.3) 
dt wp, dt dda 
The general procedure for the quantum mechanical treat- 
ment of a physical system suffers from the disagreeable fact 
that the expression for the energy in terms of the canonical 
variables must be taken from the classical model, and in ad- 
dition the transition to quantum mechanics is even then not 
unique, for the model offers no means of telling whether a 
monomial such as p*q¢ is to be interpreted as pg, pgp, qp* or 
a linear combination of all three [cf. IV, § 14]. The provisional 
character of such a procedure 1s clear, but the results so far ob- 
tained seem to justify the hope that the path we have entered 
upon will lead to a unique formulation of the laws governing 
the actual physical phenomena. We need then concern our- 
selves longer with the general mechanical scheme. 


§ 12. Motion of a Particle in an Electro-magnetic 
Field. Zeeman Effect and Stark Effect 


Let the spatial co-ordinates x yz now be denoted by %, x2 x, 
andthetimetby x». If is the scalar and c YW the vector potential 
of the electro-magnetic field, then in the theory of relativity 


= $, 2, YA, 2.) ae (Po, 1, Po, $s) 


are the components of a vector in the space dual to the 4-di- 
mensional world. Let 
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Fy, fa, 39 are the components of the electric field strength &, 
c(Fo3, F3,, fy.) the components of the magnetic field strength §. 
Denoting the components of the velocity of a particle by 
V1, Us, U3, 1tS proper time 1s 


ds = V dt? — (dx? + dx? + dx?) /c? 
=: dtV1 — v2/c?  (v® = v? + v2 + v2). 


AX ; : 
With the world vector u* = 7, ‘8 associated the dual u, with 
components 2 

Use ul (Fs), 2.3). te cP a. 


The invariant equations of motion for a particle of mass m and 
charge — e are 


d(mu,) _ : P 
a ee rae u 
or 
| 3 
d(mu;) e( Fe + x Fuve) (i= 1, 2,3). (12.1) 
at k=1 


The right-hand side is in fact the ponderomotive force 
l 
— e(E + =[06}). 


These equations arise from the Hamiltonian function 


eo eddy -f- C mic? + y (Pp, ae ep,)?, (12.2) 
t=) 


in which x,%2%3; ~P,P2p3 are the canonical variables. In fact, 
the Hamiltonian equations 


dx; oH _ clp, + e¢,) 
ne at (OP, / 
yield 
pi + ed; = mu, ; 
in the remaining equations 
dp, 0H _ Igo 3 Id, Pet by} 
dt (x, e| ne 
the left-hand side 1s 
d 1 0 i : ry) i 
(m2;) (9; mn og: oe). 


~ dt meaty a bn OX, 


But this is the desired equation (12.1) : 
: | 
d(mu,) ee ef (SE ae =) the IS &4 sé Fo}, 
k=1 


dt 0X; dX 0X; = Xx 
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The negative energy — H is the time component py of the dual 
vector whose space components are the components of linear 


momentum ) = (fj, Po, Ps), So the equation (12.2) can be written 
in the rational form 


] 3 
~alPo =f egg)? ~ eS (p; + ed;)* == m%c?, 


From this we obtain the simple rule: The influence of an electro- 

magnetic field on a particle of charge — e can be expressed by re- 

placing px by Pa + eh, 1m the equations of motion for a free particle. 
On going over to quantum theory p, becomes the operator 

ze and 1s contragradient to the 4-dimensional displacement 
a 

dx,, as is seen from the equation 


dae) 


Our rule is now: On introducing a field of potential ¢, 


dy 
x,t 


d ”) 1€ 
oy, must be replaced by ox, a 7, Po (12.3) 
in the wave equation of the particle. Only Jy has asimple physical 
significance ; it 1s therefore to be assumed that the laws which 
govern yw remain invariant on replacing p by e+, where A is 
any real function of position in space-time. On the other hand, 
in the classical theory of the electro-magnetic field only the 
field strengths, and not the potentials, have an objective signifi- 


; ; P) 
cance, i.e. the laws are invariant on replacing ¢, by ¢, — an 


where p is also an arbitrary function of the x, On examining 
our wave equation for these invariantive properties we find 
that it is not invariant under each of them separately, but that 
there must exist a certain relation between A and p. The field 
equations for the potentials b and @ of the material and electro- 
magnetic waves are invariant under the simultaneous replacement 
of 
ia. h da | 
PP EO BNE. Pal ee 
here X is an arbitrary function of the space-time co-ordinates. 
This “ principle of gauge invariance ’’ is quite analogous to that 
previously set up by the author, on speculative grounds, in 
order to arrive at a unified theory of gravitation and electricity.”° 
But I now believe that this gauge invariance does not tie to- 
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gether electricity and gravitation, but rather electricity and 
matter in the manner described above. We shall discuss this 
principle more thoroughly in Chap. IV; its significance and 
its interpretation will then be more apparent. 

On passing to the limit ¢— oo in (12.2), after separating 
out the factor mc?, we return to ordinary mechanics : 


H = eb) + ext + eg;)?. 


On neglecting terms which are quadratic in the ¢,, we find, in 
addition to the kinetic energy )’p?/2m, the potential 
i 


oer ee - (pl). (12.4) 


We have already made use of the first part, that due to the 
electric field, in§ 5. If we have, in addition to the field originat- 
ing in the nucleus, a homogeneous electro-static field in the 


direction of the z-axis and of strength /*, for which d = — F <a, 
it adds the perturbation term 
W= el’ +2 


to the energy. A homogeneous static magnetic field § 1s 
, 1 
obtained from the vector potential ¢ = 5 [Sor], t= (x, y, 2); 


this adds to the energy the perturbation term 


e . e 
nr (p(Syr]) = er ({rp]§), 
1.e, 
eho. 


a tn etn tt teen 


Zeeman Effect.-—If the homogeneous magnetic field strength, 
of magnitude |{9|, is in the direction of the z-axis, the per- 
turbation term 1s 


W=ho-L, o= el) (12.6) 
2pLc 
On choosing the characteristic functions ~¢" as our co-ordinate 
system in the system space of the functions yf, W, as well as 
the energy of the unperturbed atom, is in diagonal form; in 
the state defined by ml, m it has the value 


ho+m., (12.7) 
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The components (nl, m) — (n’'l’, m’), consistent with the selection 
rule for m, into which the line with frequency v = 7 (En — By) 
is broken up give rise to but three lines: one corresponding to 
all the transitions m-—> m, which is linearly polarized in the 
direction of the z-axis and is undisplaced ; one which is circularly 
polarized perpendicular to the z-axis, the frequency v of which 
is displaced by + 0 (m->m — 1); and one which its circularly 
polarized in the opposite sense, with frequency v — o instead 
of vy (m>m-+1). This normal Zeeman effect is found only 
in the so-called singlet lines. 

Stark Effect—In accordance with the general perturbation 
theory, the displacement and resolution of terms in the presence 
of a homogeneous electric field is determined, to terms of first 
order, by the matrix 


e+ <2, 


In consequence of the selection rule />/-+ 1, <z> = 0, unless 
accidentally all energy levels whose azimuthal quantum numbers 
differ by 1 coincide. Ignoring this exceptional case, we should 
expect to find no 1* order perturbation effect increasing linearly 
with the field strength F (linear Stark effect), but only a quadratic 
effect, which is much smaller. This is in agreement with the 
experimental data on alkali atoms. Hydrogen is, however, 
degenerate, since for it energy levels with the same principal 
quantum number m and /=0, 1, +++, »—1 coincide. The 
calculations for this case have been carried out by Schrodinger 
and compared with experiment.”® 


§ 18. Atom in Interaction with Radiation 


Following Feans, black body radiation is mathematically 
equivalent to a system of infinitely many oscillators. Maxwell’s 
equations for the free ether are 


Lr _ 


div = 0, curl € + ~ = 0; 
1 
div © = 0, curl ) — = 57 


In order to simplify the relations, we assume that the walls of 
the radiation cavity of volume V are reflecting; then ©€ is 
perpendicular to the walls at the boundaries of the cavity. 
Since the black body is at rest it is of no particular advantage 
to carry through the calculation in a relativistically invariant 
manner; we may therefore normalize the vector potential 
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eM in such a way that the scalar potential vanishes. We then 


have € = — 7 and the equations in the first row are satisfied 
by © =c-curl Y%; the equations in the second row become 
. 1 dy 


On the boundary % is normal to the walls. Let the characteristic 
numbers and characteristic functions of the equations 


ax + 4% = 0, div YU = 0, 


with the boundary condition that Mf is there normal, be denoted 
by 
“v= Py (2250),. Wes tees ly Bod ee 


normalized in accordance with 


{(UUp)dV = 43,5. 
V 
On setting 


VD gt Uy, 


Where the coefficients g% depend on time but not on position, we 
find for them the equations 
a*g* 
2 ae 
at? a Px q = 0. 


. _ dq* a a 
Introducing “T= p in addition to the g*, this equation 1s 
that for an oscillator with Hamiltonian function 


Dietec oe Wha 
H. = 9(P*)* + 5palg*)?s 
we readily find on applying 
C= — PY pA, = ¢ Ji q*: curl U, 
that the energy of the radiation field 1s in fact given by 


H= g-[ (8+ 9)dV = SMa; 
Sar os 
4 


with this we have proved the theorem due to Feans. For high 
frequencies p there are approximately 


V p*dp 


n¢3 


(13.1) 
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modes of oscillation in the frequency interval p, p + dp.27. We 
are interested above all in the limiting case of an infinitely large 
cavity ; the spectrum then becomes continuous and our formula 
for the density of frequencies becomes exact. 

On quantizing this mechanical system of infinitely many 
oscillators 2° in accordance with the theory of the oscillator (§ 3) 
and the process of composition (§ 10—but cf. remark on p. 109), 
we find as possible quantum states s, each of which is characterized 
by the fact that in it there is associated with each index a an 
integer n, 2 0. In this quantum state 


- Hy = hpa( na + ) 


or, on choosing the additive constant in the energy in such a 
way that the lowest energy value which the black body radiation 
is capable of assuming 1s 0, 


Ho Ne Wp. 1 SD Net ps 
In the language of photons this means that when the cavity 


is in the state s it contains 2, photons of each kind « The 
matrix element 


Ge A Ni Wag Og OOS Sy pte ye gg 
vanishes unless all the equations 
Ny = My, Ny = Ng, Ng=Mg,** 
hold with the exception of n, = 1,, which is to be replaced by 
n =n, +1 or ni=n,—1. 


In the first case we have, by eq. (8.12), 


Vee = Ps (Emission), (13.2) 
Pa 
and in the second 
Ci = —s (Absorption). (13-2) 
Pa 


The first transition s > s’ consists in a photon of kind « springing 
into being, the second in the disappearance of one such photon, 
It follows from the above that in a transition for which q%,, + 0 
all other g®, must vanish. 

Let an atom with fixed nucleus and electric dipole moment q 
interact with the radiation field. Differentiate the quantum 
states of the atom from one another by means of the index n 
and denote the corresponding energics by hv, ; then q = ||Qnn’||. 
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A quantum state of the total system consisting of both atom 
and radiation is characterized by the quantum numbers 


mM, My, Me,* * *, May? * ° 
The effect of the radiation on the atom its, in accordance with 
eq. (12.4) of the preceding paragraph, given to a first approxima- 
tion by the perturbation term 
eW = (qX2). 


It can be shown that the addition of such a term to the 
Hamiltonian function of the total system will, according to 
classical theory, not only indicate an influence exerted on the 
atom by the radiation field, but will also modify the equations 
of Maxwell in a way which indicates that the motion of the 
electrons in the atom affects the radiation field. The per- 
turbation term will accordingly call forth emission as well as 
absorption. To a sufficient approximation we may take for Y 
its value at the point occupied by the nucleus, provided we restrict 
ourselves to radiation whose wave-length 1s large compared with 
the dimensions of the atom. We now have 


eW = »(g U,)Q*. (13.3) 


From this it follows than an element €- W,,, 4-5 can only differ 
from 0 if s and s’ are such that all 2, = ng with the exception 
of a single one 2,, which must equal », +1. Then only the 
a'" term contributes to the sum (13.3), and we have 


EW ns. nigh (qa Y,) ° Gai (13.4) 


Bohr’s frequency condition, which asserts that the emission or 
absorption of a photon in state « with energy hp, 1s associated 
with a quantum jump of the atom in which an amount 
+ h(vy — vy) = hp, of energy is lost or won, need by no means 
be satisfied here. The finite cavity has its own frequencies p,, 
and may therefore be in no position to take up the frequencies 
associated with the quantum jumps of the atom. This is true 
in principle, but as a matter of fact, as we shall see, Bohr’s 
frequency condition 1s fulfilled to a very close approximation in 
the overwhelming majority of all transitions ; and this is more 
and more the case the larger the cavity 1s. 

Let the atom be in the state » and the radiation in the 
state s = {n,}. We set 


LA Ng pa = V+ U(p)dp, (13.5) 
where the sum on the left is to be extended over those indices « 


for which p, lies between p and p+ dp; hence U/(p)dp is the 
energy density of the radiation contained in the frequency 
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range p, p+ dp. In accordance with (10.5), the probability 
that the atom will find itself in the state n’ after time ¢ is given 
by 

2 1-08 (Yan + paa')l 

hs (Van! ie Psa)” 
The contribution to this sum due to the cases in which a photon 
is emitted is, in accordance with equations (13.2), (13.4), given 
by 


-1EWas, nt |? 13.6 
(13.6) 


2 1 — cos (Van’ _ Px)t . h(n, ate 1) 2 . 
aa en rear sea Fs | (Gun Wa) |?,  (13°6.) 
and that for absorption by 


2 yl = 608 (Yan + pall hte | (5 ; 
h? 2 (Vnn’ te Px)* 2px (Gna 2a) (15.64) 


Consider first the case in which the term level vp, is higher than 
Vani Van' = ¥n— Yn = — v is then negative. We now collect 
together all those terms « in the sum (13-6,) for which p, lites 
between p and p+ dp. Since the position of the atom 1s not 
exactly fixed—even in consequence of the variations caused by 
the emission of photons—we may, for small wave-lengths, 
replace 2? by its mean value 47/V as given by the normalizing 


equation )w2dV = 4a, and we may also assume that all 


directions are equally probable for Y,. The square |(%,q)|? of 
the scalar product of 2% with a fixed vector q has then the mean 
value a -|q|%. (13.6,) then becomes 
1 — cos (p -- v)t 4a Idan'l? St Ne Pa 
(p — v)? 3. ”V 2p 
On introducing (13.5) the sum (13.6,) may, to a good approxima- 
tion, be replaced by the integral 


eae (i Se, See 
7 ( — ve pe 
Essentially the only elements which contribute to the value of 


this integral, for a time ¢ large in comparison with the duration 1/v 
of an oscillation, are those for whichplies near tov. On developing 


U(p) __ Ulv) 
p? yy 
in powers of p — », the first term in the expansion contributes 
+ 0 
U(v) 1 — cos x Uv) 
y2 tf de = ot (13.7) 


— 0 
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to the integral; all others are to be neglected. Similarly the 
entire amount (13.6,) due to emission is negligible, for its de- 
nominator (p+ v)? vanishes nowhere. This means that the 
transition 1s almost invariably associated with the absorption of 
a photon whose frequency lies very close to v. The probability 
that the atom will appear in the higher state n’ after lapse of 
time ¢ increases in proportion with ¢; the factor 


4m? U(v) Aa? 
3 (hv)? ; Gan’ |? sie Bh2 : U(r) |Gnn’|? 


is the probability that the transition n —> n' take place in unit time. 

This formula was obtained for the case in which the state 
n' possessed a higher energy level than . In the reverse case 
only the sum (13.6,) due to emissions contributes an appreciable 
amount. We now put vay = V_ — Vy = v and obtain the same 
formula with this difference: in place of 7, we now have 2, + 1, 
or in place of the sum (13.5) the sum 


Shing + lpr = Lhnsp.r + Lpa. 


The first is + U(p)dp, and we denote the second by V « u(p)dp. 
This latter is equal to (hp) times the number of modes of vibra- 


tion of the cavity within the frequency interval p, p + dp; hence 
by (13.1) 


hp*dp 


\ 
Loy = 2A. 
rics? (v) r*c3 


l’ + ulp)dp = V - 


The probability that the atom drop from state n into the lower state 
nin unit time is given by 


Agr 


52 [U(v) + 1(¥)}|qan'|?- 


The additional term wu(v) is characteristic for spontaneous 
emission. When the radiation is not enclosed in a_ black 
body, i.e. when there is no radiation density U(v), the proba- 
bility that the atom drop from the state n to the lower stale n’ 
in unit time, emitting thereby a pnoron whose frequency hes 
in the immediate neighbourhood of vy = vy, - vy, 18 


Ap 
a Gan’. 


This agrees with the formula obtained by integrating (8.11) 
over all directions. The probability that the atom jump from 
the level » into a higher level n’ (v, > v,) under these same 
conditions 1s zero. 


108 QUANTUM THEORY 


In the energy field of the black body radiation we find not 
only absorption, but also “' stimulated emission,” both of which 
are proportional to the energy density U(v). On setting 


47? 


Ann = 3h2 lGan’|? 


, (13.8) 


the probability for a jump from state » to a higher state n’ in 
unit time is 


(v = vy — va), (13.9) 


Onn! = Levi’ ° Uv) 


and the probability for the inverse jump, the drop from n’ 
to 7, Is 


Wain = Anra[U(v) + u(v)] . (13.9) 
Since Inn’ is an Hermitian matrix, 
Ane Agal (13.10) 


If there are a number of atoms in the radiation field and the 
whole system is in a steady state, then on the average as many 
atoms must make the jump »— »’ in unit time as make the 
inverse jump »’-—> x. On denoting the number of atoms in 
the state by N,, these considerations are expressed in the 
condition 

Ann ' N,U(v) = Ann’? N,[U(v) + u(v)] 
or 


pee eae (13.11) 


The probability coefficients Anja = Ay, have entirely dis- 
appeared—or rather, almost entirely, for the equation is valid 
only under the assumption that A,, +0 or Qyy +0, ie. 
the transition 22x’ is not to be forbidden by the selection 
rules. But for such a system in thermal equilibrium N, must, 
as shown by Boltzmann, be proportional to 

—~E /ke —s-_.._ avy, /kO 

eé a 0 == @ n ; 
where @ is the temperature and & the Boltzmann constant. 
Equation (13.11) then becomes 


eben! — vq iho u(v) 


Uv) 


or the Planck radiation formula : 
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this formula is valid for all frequencies v whose energies can be 
exchanged by the absorbing and emitting atoms in accordance 
with Bohr's frequency condition.?® 

We have thus finally returned to the historical origin of the 
quantum theory. We must now add three remarks concerning 
this treatment, due to Dirac, of energy exchange between matter 
and radiation. In the first place, it 1s able to explain the fact 
that the spectral lines are not sharp, but possess a natural breadth.®® 
Secondly, we must inquire what causes this difference between 
absorption and emission, processes which are transformed into 
each other on changing the direction of time. Indeed, the 
fundamental mechanical and field laws are invariant under the 
transformation t-> —t! The answer is that this difference 1s 
due to the preferential direction in time involved in the application 
of the theory of probability ; we assume a fixed initial state and 
calculate, with the aid of transition probabilities, the distribu- 
tion over the various states at a later time, not the distribution 
which would result from the equations for an earlier time. If 
no assumption is made concerning this preferential direction, 
t should be replaced by |¢| in (13-7). And finally, the fact that 
we have here treated Maxwell’s equations as classical equations 
of motion, and as such have subjected them to the process of 
quantization, may give rise to serious doubts—for in our general 
formulation Maxwell’s equations are already the quantum- 
theoretic wave equations for the photon! But we shall see 
in Chap. IV, § 11, that this method 1s in fact the correct one 
to employ in order to go from one corpuscle to an indefinite 
number of corpuscles. For since the number of photons must 
remain indefinite --as a photon can, in contrast to an electron, 
spring into being or disappear-—-the method of composition 
described in § 10 is not applicable to them. 


CHAPTER III 


GROUPS AND THEIR REPRESENTATIONS 


§ 1. Transformation Groups 


TT ore concept of a group, one of the oldest and most 


profound of mathematical concepts, was obtained by 
abstraction from that of a group of transformations. 

A point-field, a domain of elements which we call points, 
on which the transformations operate, underlies the trans- 
formations. This point-field may be either the totality of a 
finite number of individually exhibited elements or an infinite 
set, in particular a continuum such as space or time. A 
mapping or correspondence S of the point-field on itself 1s 
determined by a law which associates with each point p of the 
field a point p’ as image: p— p’ = Sp; two correspondences 
Sp and Tp are identical if for all points p the two image 
points Sp and Tp coincide. If the point-field contains a finite 
number of elements the correspondence S can be defined by 
giving explicitly the image point for each point p; for infinite 
sets, however, the association is only possible by giving the 
law of the function S. 

Among such correspondences there is a particular one which 
associates with each point p the point p itself: pp; it 1s 
called the identity J. Two correspondences can be applied 
successively : if the first sends the arbitrary point p into p’ == Sp, 
the second p’ into p” = Tp’, then the correspondence resulting 
from the composition of the two 1s defined by the association 
p> p” = T(Sp) and is denoted by TS (read from right to left !). 
The resultant correspondence depends on the order of the two 
factors S and JT. In order that composition be possible it 1s 
essential that the correspondences are ones which map the 
point-field on itself, and not on another point-feld. 

We shall restrict ourselves to one-to-one correspondences S: 
the image points p’ = Sp associated with p shall always be 
distinct, and each given point p’ shall appear as the image of 
one (and only one) of the points p. Consequently such a one-to- 

110 


TRANSFORMATION GROUPS 111 


one correspondence S: p-> p’ determines a second, the inverse 
S-!: p’—» p of S, which just cancels it : 
S'(Sp) = p, S(S'p') = p’ or 
SUS = 7, So) = 

The inverse of S~1 is again S and the identity / is its own inverse. 
The resultant 7S of two one-to-one correspondences S, T 1s 
itself one-to-one, and its inverse is (7S)7} = S177} —for 
on inverting the correspondences p-> p’-—> p” there results 
p'’ > p'-» p. Henceforth we shall consider only those corre- 
spondences, also called transformations or substituttons, which 
are one-to-one. In this domain we have, in accordance with 
what has been said, the two fundamental operations of inversion 
and composition. 

Eexamples.--1. Let the point-field consist of # elements 
exhibited individually ; bring them into a particular order by 
numbering them with the integers 


12s mn. (1.1) 


This numbering consists in a one-to-one reciprocal relation 
between the elements of the point-field and the integers or 
possible ‘* positions '’ g in the series (1.1). A permutation con- 
sists in the transition from one such arrangement to another. 
If we wish to operate in space we may think of the positions as 
fixed compartments into which the movable elements can be 
laid, or, conversely, we may think of the elements as fixed and 
shift the movable numbers about. With each permutation 1s 
associated a one-to-one correspondence p— p’ which tells 
which element p’ occupies, after the exchange, the position 
previously held by p. Insofar as the method of numbering ts 
considered as left to convention, the permutation is nothing 
more than this one-to-one correspondence. The concept is to 
be understood in this way when we are concerned with the 
composition or successive application of permutations. 

2. A kinematical example of a group 1s offered by the motions 
of a space-filling substance, in particular those of a rigid body. 
The positions or numbers of the preceding example are here 
represented by the material points and the point-field is the 
space itself. The one-to-one correspondence p — p’ connects 
the initial with the final state: that material point which origin- 
ally covered the spatial point p is taken to the point p’ by the 
motion. Congruent correspondences of space on to itself will 
also be briefly referred to as ‘‘ motions” in the geometrical 
sense. 

The concept of a group of transformations 1s now readily 
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formulated. We understand by it any system © of transforma- 
tions of a given point-field, which is closed in the sense of the 
following conditions : 

1. It contains the identity ; 

2. If S belongs to &, then its inverse S~! does also ; 

3. The resultant TS of any two transformations S, T of & 
is also a transformation of ©. 

As examples we name the group of all x! permutations of n 
things, the congruent mappings or ‘* motions ”’ of 3-dimensional 
Fuclidean space, all homogeneous linear transformations in 
n variables with non-vanishing determinants (affine correspond- 
ence of an n-dimensional vector space) and the group of unitary 
transformations in ” dimensions. 

If the point p goes over into p’ by means of a transformation 
of the group @, then p’ is said to be equivalent to p (with respect 
to the group ®&). The same concept is applied when we are 
considering instead of a point p a figure consisting of points. 
Expressed in these terms, the three requirements for a group 
are nothing other than the three axioms of equality : 

1. p is equivalent to p; 

2. If p’ is equivalent to p, then p is equivalent to p’ ; 

3. If p’ is equivalent to p and p” to p’, then p” is equivalent 
to p. 

According to Klein’s Erlanger Program # any geometry of 
a point-field is based on a particular transformation group & 
of the field; figures which are equivalent with respect to @, 
and which can therefore be carried into one another by a trans- 
formation of &, are to be considered as the same. In Euclidean 
geometry this role is played by the group of congruency trans- 
formations, consisting of the motions referred to above, and 
in affine geometry by the group of affine transformations, etc. 
The group expresses the specific isotropy or homogeneity of the 
space ; it consists of all one-to-one * isomorphic correspondences ”’ 
of the space on itself, i.e. those transformations which leave 
undisturbed all objective relations between points of the space 
which can be expressed geometrically. The symmetry of a 
particular figure in such a space is described by a sub-group of 
® consisting of all transformations of G which carry the figure 
over into itself. The art of ornamental tiling, which was per- 
fected by the Egyptians, contains implicitly considerable know- 
ledge of a group-theoretic nature; we here find, perhaps, the 
oldest fragment of mathematics in human culture. But only 
recently have we been able to formulate clearly the formal 
principles of this art; attempts in this direction were already 
made by Leonardo da Vinci, who sought to give a general and 


ABSTRACT GROUPS AND THEIR REALIZATION 113 


systematic account of the various types of symmetry possible 
in a building. But the most wonderful symmetrical structures 
are exhibited in crystals, the symmetry of which is described 
by those congruency transformations of Euclidean space which 
bring the atomic lattices of the crystal into coincidence with 
themselves. The most important application of group theory 
to natural science heretofore has been in this field. 

The following considerations fit naturally into the present 
discussion. Let the point-field M on which the transformations 
S of the group & operate, be mapped on the point-field N by 
means of the one-to-one correspondence A: p—+>q; the case 
in which the correspondence serves to introduce new numbering 
or new co-ordinates is of particular importance. Through this 
correspondence A of M on N the transformation S of M becomes 
a transformation T of N; in the particular case mentioned above 
T is simply a description of the transformation S in the new 
co-ordinates. It is evident that to the composition of trans- 
formations S corresponds the composition of the corresponding 
transformations 7 of N and that a group & of transformations S 
goes over into a group § of transformations 7. The relation 
between these two transformations is 


T = ASA“, (1.2) 


for if we denote the transformation S by p — p’ and if gq, q’ are 
the points of N associated with p, p’ by A, then the transforma- 
tion g — q’ of N 1s effected by 


q>p>p>d|. 


We may also write §) = AWA In particular, these considera- 
tions apply when N and M are the same point-field. 


§ 2. Abstract Groups and their Realization 


An arbitrary number of transformations of a given point-field 
on to itself can be applied successively ; we are of course not 
restricted to merely two. But when we perform this process 
step by step it is automatically reduced to a succession of com- 
positions of transformations taken two at a time: 


ABC ++ += A[B(C + +>). 


This possibility of performing an extended composition in steps 
involving but two transformations at a time shows that the 
associative law 

(AB\C = A(BC) 


holds for any three transformations A, B, C. 
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The structure of a transformation group 1s obtained from it 
by abstraction when we allow the transformations themselves 
to degenerate into elements of an immaterial nature, retaining 
only their individuality and the rules in accordance with which 
two given transformations are composed, in a given order, to 
form a third. In accordance with what has been said such 
composition necessarily obeys the associative law. Perhaps it 
also obeys other universal laws, but since we have at present 
no indication of this we attempt a formulation of the abstract 
structure of the group by means of the following definitions : 

An abstract group is a system of elements within which a law 
of composition is given such that by means of it there arises from 
any two (the same or different) elements a, b of the group, taken in 
this order, an element ba. The following conditions shall thereby 
be satisfied : 

1. The associative law c(ba) = (cb)a ; 

2. There shall exist an element 1, the unit element, which leaves 
an arbitrary element a unaltered on composition with it : 


la=al=a. 


3. To each element a shall exist an inverse a which yields on 
composition with rt the unit element I: 

aat+=-atla=I. 

Such an abstract group is not to be confused with its realt- 
zation by transformations, i.e. by one-to-one correspondences of 
a given point-field. A realization consists in associating with 
each element a of the abstract group a transformation T(a) of the 
point-field in such a way that to the composition of elements of 
the group corresponds composition of the associated transforma- 
tions : 


T(ba) = T(b)T(a). (2.1) 
[t follows from this that to the unit element I corresponds the 
identity / and to inverse clements a, a~! correspond inverse 
transformations : 

T(a“!) = T-\(a). (2.2) 
The first assertion follows from the particular case 

T(a)T(t) = T(a) 
of (2.1) by left-handed composition with the reciprocal of the 
transformation T(a); (2.2) is then contained in (2.1) as the 
particular case b= a™'. The realization is said to be faithful 
when to distinct elements of the group correspond distinct 
transformations : 


T(a) + T(b) when a + 0. 
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In accordance with the fundamental equation (2.1) the necessary 
and sufficient condition for ‘faithfulness’ 1s that T(a) shall be 
the identity only if a is the unit element. For if a, 6 are two 
elements of the group it then follows from T(a) = T(b), 1.e. 


T(a)T-(b) = T(a)T(b) = T(ab) = 1 


that under these conditions ab“! = J,i...a= 0. If the abstract 
group is obtained from a transformation group & by abstraction, 
then conversely @ is a faithful realization of it. 

In the study of transformation groups we always deal with 
two manifolds, the structureless point-field and the manifold of 
group elements, the structure of which 1s expressed by the law 
of composition. The original problem thus resolves itself into 
two; the examination of the various group structures possible 
and the examination of the possibility of obtaining realizations 
of the given abstract group by transformations of a given point- 
field. The historical development of the subject has shown that 
it is advantageous to effect this division into two problems ; 
they are of fundamentally different character and require 
fundamentally different mathematical equipment for their 
discussion. 

In accordance with our method of introducing the abstract 
eroup, which we henceforth refer to simply as the group, it 
serves merely to give the structure of the group; the nature of 
its elements is immaterial. This abstraction from the nature 
of the elements is expressed mathematically by the concept of 
isomorphism. If we have two groups g, g’ and there is as- 
sociated with each element a of g an element a’ of g’ in a one- 
to-one way: a =a’, such that 


(ba)’ = b’a’, (2.3) 


then the two groups are said to be simply isomorphic. Simply 
isomorphic abstract groups offer no means of distinguishing one 
from the other. The concept of isomorphism can, of course, be 
applied to transformation groups. Two isomorphic transforma- 
tion groups can be considered as faithful representations of 
one and the same abstract group. A group may be isomorphic 
with itself; it is then said to be automorphic. Such an auto- 
morphism occurs when g and g’ coincide, 1.e. when a one-to-one 
reciprocal association a@a’ satisfying the condition (2.3) is 
established between the elements of the group g. 

The question arises whether or not every abstract group 
possesses a faithful realization. If this were not the case the 
concept of an abstract group as developed above would be too 
broad—-there would exist, in addition to the associative law, 


) 


116 GROUPS AND THEIR REPRESENTATIONS 


other purely formal laws for the composition of transformations 
which are satisfied by every transformation group. Conversely, 
a proof of the realizability of any abstract group would tell us 
that all that can be said about the formal laws for the com- 
position of transformations 1s contained in our conditions (1) 
to (3). We can, in fact, construct a faithful realization of any 
abstract group g by taking as the point-field the group manifold 
itself and letting correspond to each element a of the group 
the transformation 
S—>s' = as 


of the group manifold on to itself. This ‘ left-translation ”’ 
t, is obviously a one-to-one reciprocal transformation which 
has as inverse the transformation s = as’. If a and 6 are 
distinct elements the corresponding transformations t,, t) are 
distinct, for they allow the unit element I to correspond to the 
distinct elements a, b respectively. If we perform in succession 
two left-translations 


s>s'=as, s’>s" =b)s' 


the resulting transformation 1s, in consequence of the associative 
law, 
s -> Ss’ = b(as) = (ba)s. 


Consequently the left-translations constitute in fact a faithful 
realization of the abstract group. However, the right-trans- 
lations behave otherwise, for if we denote the mapping 
s—> s’ = sa of the group manifold on itself by t*(a), we find 
instead of (2.1) the equation 


t*(ba) = t*(a)t*(b). 


§ 3. Sub-groups and Conjugate Classes 


A sub-group 4g of agiven abstract group gis a set of elements 
contained in g which itself fulfils the characteristic group con- 
ditions: the unit element | belongs to g’, with a belongs also 
a”! and witha, balso ba. These three conditions can be reduced 
to the one: if a, b are any two elements of g’, then ba™ also 
belongs to g’. We assume, of course, that the partial system 
consists not merely of the element I], but the other limiting 
case, in which q’ coincides with g, shall be included under the 
concept of a sub-group. 

Examples are readily found. In the group of Euclidean 
motions are contained, for example, the group of rotations 
(which leaves one point, the centre, fixed) and the group of 
translations. The unitary transformations constitute a sub- 
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group of the complete group of all homogeneous linear transforma- 
tions; the even permutations a sub-group of the group of all 
permutations. If we are dealing with a transformation group 
®, all those transformations of @ which leave a particular 
point p fixed (1.e. which carry p over into itself) constitute a 
sub-group &,. Instead of a point p the fixed element may be 
any figure composed of points; the transformations of the sub- 
group must either leave the figure as a whole fixed (1.e. they must 
carry each point of the figure over into another such) or the 
more restrictive condition that they leave each point of the 
figure fixed. We can also obtain sub-groups of & by employing 
invariant functions instead of invariant figures. If P(p) is any 
function of position on the point-field with elements p we as- 
sociate with the transformation S:p-—> p’ the function wp’ 
defined by ’(p’) = d(p) and say that it is obtained from % by 
the transformation S. If p’ = Sp, p” = Tp’, the equations 
bp) = o(P') = o'(P") 
show that the composition of the transitions gow’ and 
pb’ -> ws’ associated with S and 7 result in the transition p > p" 
associated with 7S. Now consider all transformations S of & 
which carry #(p) over into itself, 1.e. for which P(Sp) = w(p) is 
an identity in p; they constitute a sub-group $ of @, and 
u(p) is an invariant of %. In this way we can separate out 
the rotations from the homogeneous linear transformations by 
requiring the invariance of the unit quadratic form. The sub- 
groups contained in a finite group g, which 1s described by 
exhibiting each of its elements and giving explicitly the result 
of composition of each two, can be obtained by inspection. 
There is associated with cach element a of the group g a 
cyclic sub-group denoted by (a) : 


5 an? am, a° Te I, a, a’, 3 "5 (3.1) 
the clements a" of which are defined inductively by the equations 
ae |. sae "a, 

These elements constitute in fact a group, for m and m being 
any integral exponents we have 

qntm = ara". 
(a) is the smallest sub-group which contains a, 1.e. 1ts elements 
are common to all sub-groups of g which contain a. The 
elements of the set (3.1) can either be distinct or—and this 
latter must be the case if g is a finite group—they must repeat 


themselves after a cycle of h terms: I, a, a7, ---, a! are 
distinct but a* == §. his called the order of the element a. 
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The order of a finite group is the number of its elements ; 
accordingly, the order of an element a agrees with the order of 
the cyclic sub-group (a) generated by a. A group is said to be 
commutative or Abelian if composition of its elements obeys 
the rule ba = ab. Cyclic groups are therefore Abelian. 

If a runs through the sub-group § of g the associated (left-) 
translations f, constitute a group of transformations which 1s 
simply isomorphic with §, the point-field of which is the group 
manifold. We say that two elements s, s’ which are equivalent 
with respect to this transformation group are (left-)equivalent 
with respect to and express this situation by the notation 
‘5’ = 5 with respect to )’’; the condition for it is that s’ = as 
where a is an element of §. In this way the elements of g are 
divided into sets of elements which are equivalent to §. If 
the number of such sets is finite, it 1s called the index of § in g. 
If g is a finite group the number of elements in each of these 
sets is given by the order of §, fer different translations t, send 
s into different elements: as + bs if a +b. The order of } 1s 
accordingly a divisor of the order of g, and the quotient of these two 
is the index of h. 

The considerations at the end of §2 above, which were 
developed for groups of transformations, suggest a second 
realization of the abstract group g. We associate with the 
element a the correspondence 


s—> $s = asa™} (3.2) 


of the group manifold on itself. Vhis correspondence, which 
we call the ‘‘ conjugation’”’ f,, 1s reciprocal one-to-one, and has 
as inverse s = a's’a. The law of composition is obeyed, for 
from 

s>s' = asa, s'’—>s" = bs'b} 


we obtain the product 
s” = basa“! 6} = (ba)s(ba)™!. 


Two elements s, s’ of g are said to be conjugate if they are 
equivalent with respect to the group of all conjugations. Ac- 
cordingly, the whole group is divided into classes, any element 
of one of which is conjugate to any other element of the same 
class. When we speak of classes within a group without a 
more explicit description we mean these conjugate classes. 

The realization of q by the group of conjugations is in general 
a ‘‘ contracted ” rather than a faithful realization. In particular, 
the conjugation f£, coincides with the identity if @ commutes 
with all elements s of the group. The totality of all such ele- 
ments a is called the central of the group; it is obviously 
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an Abelian sub-group of g. But this disadvantage of con- 
jugation over translation ts offset by an advantage ; conjugation 
is an isomorphic correspondence within the group itself which 
leaves the unit element invariant and which associates with 
each sub-group } of g another such, the conjugate sub-group 
aha~!. These facts, which are expressed by the equation 


a(st)a~! == (asa~})(ata~), 


were already contained implicitly in the considerations at the 
end of § 1. 1s said to be a self=conjugate or invariant 
sub-group if it coincides with all its conjugate sub-groups. 
The importance of this last concept is best seen in the 
following: 
Theorem. If isaninvariant sub-group and = denotes equtva- 
lence with respect to it, then it follows from 


s'=s,lU=t that s't'=st. (3.3) 
To prove this we note that s’ == as, t) = bt (a, b in §) yield 
s't’ = asbt = (ac) (st). (3.4) 


for c = sbs™ belongs to h with b. Since ae lies in § our assertion 
is proven. It is readily seen that the invariantive nature of § is 
necessary as well as sufficient for the validity of (3.3). In deal- 
ing with an invariant sub-group ) we need not distinguish 
between right and left equivalence with respect to h—indeed, 
the above proof was based on this fact. 

We may, if we like, consider equivalent elements as not 
differing from one another (by application of the principle of 
definition by abstraction); but by thus allowing equivalent 
elements to fall together the group property of q 1s, in general, 
forfeited. In accordance with the above theorem it still remains, 
however, if § is an invariant sub-group. The group obtained 
from g by identifying all elements which are equivalent with 
respect to h is called the factor group q/; its order is the 
index of the invariant sub-group § of q. 

These concepts are of assistance in examining the way in 
which a group may be “ contracted ”’ on setting up a realization. 
Let the transformation 7(a) of a given point-field on itself 
correspond to the element a of the abstract group g in the realiza- 
tion under consideration. Then T(a) = 7(a’) if and only if a’ 
is obtained from a by composition with an element e (1.e. a’ = ea) 
for which T(e) is the identity. Such elements e obviously con- 
stitute a sub-group § of g, for it follows from 


T(e) =I, T(e’)=J] that Tee’) = T(e)T(e’) = I. 
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h is, in fact, an invariant sub-group, for if T(e) is the identity, 
the same is true of 


T(aea~“') = T(a)T(e)T~'(a) = T(a)T~ (a). 


In any realization of an abstract group g by a group of transforma- 
tions the elements of a certain invariant sub-group } of q correspond 
to the identical transformation ; two different elements will be 
associated with the same transformation 1f and only if they are 
equivalent with respect to h. The group of transformations 1s 
consequently a faithful realization of the factor group q/). 


§ 4. Representation of Groups by Linear 
Transformations 


On requiring that the transformations which are to serve 
as a realization of a given abstract group g be linear and homo- 
geneous we arrive at a problem which is most fruitful from the 
mathematical standpoint and which 1s at the same time of 
greatest importance for quantum mechanics; we then speak of 
a representation, instead of a realization, of the group. An 
n-dimensional representation of g, or a representation of degree 1, 
consists in associating with each element s of the group an 
affine transformation U(s) of the n-dimensional vector space 
R= RM, in such a way that these transformations obey the 
law of composition 


U(s)U(t) = U(st). (4.1) 


We then say that s induces the transformation U(s) in the 
representation space §. On choosing a definite co-ordinate 
system in §R each transformation U/(s) is represented by a square 
matrix of m rows and columns, the determinant of which does 
not vanish. On replacing the original co-ordinate system by 
another, obtained from it by the transformation A, the corre- 
spondence which was formerly represented by the matrix U(s) 
is now represented by the matrix AU(s)A~!. Consequently if 
the association s + U/(s) is a representation, the association 


s—> AU(s)A 


is obviously also one; this latter representation is said to be 
equivalent to the former. They are essentially the same, 
differing only in the choice of the co-ordinate system in terms 
of which they are described. 

Examples.—A representation in one dimension consists in 
assigning to each element s of the group a non-vanishing number 
x(s) in such a way that 


x(st) = x(s) x(¢). (4.2) 
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In particular, x(f) = 1. A most trivial 1-dimensional repre- 
sentation 1s obtained by assigning to each s the number 1: 
x(s) = 1. This special case is called the zdentical representation. 

Consider next the so-called symmetric group, the group 
a == my, of all f! permutations of f things. The association 


s>6,= +1, 


according as s 1S an even or an odd permutation, defines a 
l-dimensional representation, the “ alternating ’’ representation 
of the group aw. For the character 6,, which distinguishes 
between the even and the odd permutations, satisfies the 
equation 


Oat a 0, : Oy. 


Let g be a finite cyclical group of order h; the elements 
s are then 


2 «6 ~ & h-1 
I, a, a?, ,a 


and a, == 1. Consider the 1-dimensional representation s — y(s) 
in which y(a) =e. The condition (4.2) for a representation 
then tells us that to the elements s of this series correspond 


2 > ee e@ h-l 
126"; ,€ 3 


and that to a* corresponds e*. Hence e4 = 1; &€ must therefore 
be an h'® root of unity and the law defining the representation 
isa’ —> e" (y= 0, 1,2, + - +). Conversely, when € 1s an arbitrary 
hk root of unity this association defines a 1-dimensional re- 
presentation of g. We have thus obtained a complete survey 
of all possible 1-dimensional representations of a cyclical group. 

The only example of a multi-dimensional representation 
which we offer at this time is the following trivial one. If 
g is itself a group of linear transformations of an 2-dimensional 
vector space R, then the association s -> s defines an »-dimensional 
representation of g. This example implies more than one might 
at first sight imagine. We have in fact to do the following : 
we first obtain the structure of the group g by abstraction from 
the group of linear transformations and then return to the 
original realization by means of the correspondence s— s 
between an element s of the abstract group on the one hand 
and the linear transformation s on the other. 

The concept of equivalence has a more general significance 
than that discussed above. It may refer to an arbitrary system 
& of linear correspondences U of the m-dimensional vector 
space . We need not assume that these correspondences 
possess an inverse (i.e. that they have a non-vanishing deter- 
minant), nor need we assume that they are associated with 
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the elements s of a group, as 1s the case with representations. 
On expressing the set of correspondences U in terms of a new 
co-ordinate system each matrix U goes over into the matrix 
U’ = AUA™',; the system 2 is transformed into the equivalent 
system &” consisting of the U’. A is here a fixed non-singular 
matrix. 

Consider a correspondence U of §t on to itself. A linear 
sub-space R’ of Rt is said to be invariant under U if the vectors 
of R’ are transformed into vectors of R’ by U. If R’ is invariant 
then the space ® (mod. 9’) obtained by projecting R with 
respect to ft’ is also invariant (cf. J, § 2, in particular Fig. 1). 
R’ being invariant, U gives rise to a correspondence U’ of § 
on to itself; we say that U induces U’ in ®’. Similarly for 
the space obtained by projection. We now pass from a single 
correspondence U to a system & of correspondences. t’ is 
said to be invariant under » if it is invariant under each corre- 
spondence U of 2. Describing * in terms of a co-ordinate 
system which is adapted to the invariant sub-space ®’, all 
matrices U of the system 2 reduce simultaneously to the form 
illustrated in Fig. 1, p. 8. 2 is called trreducible if % con- 
tains no sub-space, other than & itself and the space 0 consisting 
only of the vector 0, which is invariant under 2. We shall 
have occasion to reduce # in such a way that each constituent 
separated off is irreducible under a given system &. This 
requires the construction of a series of sub-spaces 


0, Ri, Re, meee’ R, —F R, (4.3) 


beginning with 0 and ending with ®, in which each member 
is contained in the preceding one and is such that R; (mod. R;_;) 
is irreducible. Naturally R; shall actually be larger than R;_,, 
not merely coincide with it. The implications of this reduction 
are most readily seen in terms of the matrices U of the corre- 
spondences of the system 2 on adapting the co-ordinate system 
to the “ composition series’’ (4.3), 1.e. by choosing first a co- 
ordinate system in §,, then supplementing it with additional 
fundamental vectors in order to obtain a co-ordinate system 
for ,, Rs, ° °° in turn. 

2' is said to be completely reducible if R can be decomposed 
into two sub-spaces Rt + §’, each of which are invariant under & 
and such that neither of them consists merely of the vector 0. 
This concept of complete reducibility is more exacting than that 
of mere reducibility. On describing ® in terms of a co-ordinate 
system which is adapted to this decomposition, each matrix 
U of 2' assumes the form illustrated in Fig. 2, p. 9. We are 
then faced with the problem of decomposing ® (or 2) into 
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constituents, none of which is completely reducible, i.e. of 
decomposing = f,-+° +++ 8, into invariant sub-spaces, 
none of which is completely reducible. 

We often find that reducibility implies complete reducibility, 
1.e. that in many cases we have the theorem: If §’ is an in- 
variant sub-space of 8, a second invariant sub-space §R’’ can 
be found such that ® is completely reducible (with respect to 
2X) into Wt’ + KR". We shall soon see that this is actually the 
case when ® is a unitary space and 2 is a system of unitary 
transformations. 

It was shown in Chap. J, § 3, that if the system 2 is re- 
ducible, then the system 2* of ‘‘ transposed ’’ correspondences 
of the dual space on itself is also reducible. If §:s5— U(s) 
is an n-dimensional representation of the group g the transposed 
U*(s) do not constitute a representation; it is readily seen, 


however, that on employing instead the contragredient corre- 
spondences 


U(s) = [U%(s)}3 


we do obtain a representation s > U(s) of the dual vector space. 
Ws 


This we call the contragredient representation §). 


§ 5. Formal Processes. Clebsch-Gordan Series 


Continuous groups offer what are perhaps the simplest 
examples of the theory of representations. We consider in 
particular the group ¢ = ¢, of all linear and homogeneous trans- 
formations s in 1 variables x,, %2, °°‘, %, With non-vanishing 
determinants ; we consider each set of values x; as a vector 
in an »-dimensional vector space t= 1,. The classical theory 
of invariants, first developed in England about the middle of 
the last century, concerned itself in particular with the repre- 
sentations of ¢ induced on the coefficients of arbitrary forms 
in the variables x; A quadratic form in these variables is a 
linear combination of the n(2-+ 1)/2 linearly independent 
products x,;%,; under the influence of a linear transformation 
s of the x; these products undergo a linear transformation [s],, 
and the correspondence s-—> [s], is obviously a representation 
[c]? in 2(m + 1)/2 dimensions of the group¢. The transformation 
s of the variables x, sends the arbitrary quadratic form 


a, sy, H 5 Xr 
into a quadratic form 


’ ror 
D> Ay x; XE 
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in the new variables, where the coefficients a;, are obtained 
from the a;, by a certain linear transformation s, associated with 
$; S, is obviously contragredient to [s],. The quadratic form 
characterized by a fixed set of n(n + 1)/2 coefficients a,, may 
thercfore be considered as a vector in a space of this number of 
dimensions, and the transformation s of the variables x; induces 
the transformation s, in this space. The space thus defined by 
the totality of m-ary quadratic forms is thus the point-field for 
a group of linear homogeneous transformations which constitute 
a representation of the group ¢. 

We may in the same way deal with cubic, quartic, > °°, 
f-ic forms. The totality of monomials of order f are contained 
in the formula 

xf xfs ows xfn (5.1) 


where the f; are non-negative integers whose sum 


fi + fe2+° . ‘+ f,=f. 


They constitute the substratum of a representation [c]/ in 


ni) nati d)--+(n+f—1) 
7] 7 122 Sf 
dimensions. 

But we can exhibit representations of ¢ which are formally 
yet simpler than these arising from the theory of forms. Let 
(x,) and (y,) be two arbitrary vectors in our n-dimensional 
space t and consider the products x, y,. On subjecting the x, 
and the y,; to the same transformation s of ¢ (transition to a 
new co-ordinate system) the mu? products undergo a certain 
linear transformation s X s associated with s and the corre- 
spondence s > s X s 1s an n?-dimensional representation (c)? of ¢. 
Now a system of numbers F(z, k), depending on two indices 1, k 
which run through the values 1, 2,+ - -, ”, is said to be a tensor 
of second order if under the influence of a transformation s of 
t the F(z, k) undergo the same transformation as the products 
x; yx of the components of two arbitrary vectors r, ) of r. Hence 
the tensors of order 2 are the substratum of the representation 
(c)? of c. (¢)? contains the representation [c]? which is induced 
in the sub-space of symmetric tensors of order 2; the tensor 
with components F(z, k) being symmetric if F(ik) = F(kzt). 

In geometry the anti-symmetric tensors, i.e. tensors whose 
components satisfy the condition P(ik) = —- F(ki), play a more 
important role than the symmetric ones.~ In particular, two 
arbitrary vectors (x,), (y,) define a surface element with 
components 

x{tk} = x_n — XEN; | 
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of these quantities but n(m — 1)/2 are linearly independent, 
say those for which 7 <k. On subjecting the components x, 
of the vector r and the components y; of the vector y to the 
same linear transformation s, the components of the surface 
element defined by them undergo an n(n — 1)/2-dimensional 
linear transformation {s},. s-—> {s}, is a representation {c}? 
whose substratum is the totality of anti-symmetric tensors of 
order 2. Hence the representation (c)? is reduced into the 
representations (c]? and {c}?, for any tensor /*(2k) can obviously 
be written 


F(ik) = s{F (ik) + F(ki)] + 5(F GR) — F(Ri)) 


i.e. In a unique manner as the sum of its symmetric and anti- 
symmetric parts. That this reduction is correct 1s further borne 
out by the fact that the dimensionalities satisfy 


pa Ee a 
n 9 + 5° 
Similarly three arbitrary vectors r, ), 3 determine a 3-dimen- 
sional element of volume with components 


My XE X 
a? ae 2 


These clements constitute the substratum of a representation 


7 n n(n — 1)(n — 2) 
{3| i oot aed 


dimensions. Continuing in this way we can construct 4-, 
h-, + - + n-dimensional elements; this process must cease with 
n-rowed determinants, for a determinant of the form (5.2) with 
more than ” rows must necessarily vanish identically. 

We shall see that the representations of c whose substrata 
are the symmetric and anti-symmetric tensors of order f are 
irreducible, and shall in fact solve the general problem of effect- 
ing the complete reductions of (c)f, the representation induced 
by c in the space of all tensors of order f, into its irreducible 
constituents (Chap. V). 

The tensor concept really depends on the x -multiplication 
introduced in II, §10. If the m variables x; undergo a trans- 
formation A and the x variables y, a transformation B, then 
the mn products x,y, undergo a transformation A xX B. Con- 
sidering the x, as the components of an arbitrary vector Y in 
an m-dimensional space ®,, and the y, as the components of 
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y in R,, the products x; y, may be considered as the components 
of a vector r X ) in an mn-dimensional vector space R,, X Ra. 
Hence two representations 


H:s—> U(s), 9':5—> Us) (5.3) 


of g in m, n-dimensions, respectively, give rise to a new mn- 
dimensional representation which we denote by § x §’: 


) x H':s—> Us) x Us). (5.4) 


This presents a general method of obtaining a new representa- 
tion © X §’ from two given representations §, §’. 

Denoting the representation s—>s of the linear group c for 
the moment by (c), the representations of ¢ whose substrata 
are the tensors of order 2, 3, - ++ are then (c) X (c) = (c)?, 
(c) X (¢) X (Ce) = (C8 +. 

We should, perhaps, have discussed the addition + of two 
representations before discussing their multiplication <. Con- 
sider the variables x; and vy, as the components of a single vector 
4 in an (m-+ n)-dimensional vector space; when the x, are 
subjected to the transformation A and the y, to the trans- 
formation B these m-+ mn variables undergo a certain trans- 
formation (A, B). Hence we obtain from (5.3) the representation 


§ X §':s5— [U(s), U'(s)] 


in m-+ m dimensions. The inverse of this process 1s complete 
reduction, as discussed above: § + §’ is completely reducible 
into the components § and h’. 

Another important formal method is the following: Any 
representation J’ in N-dimensions of the linear group c, in 
n-dimensions may be used to construct an N-dimensional 
representation of any abstract group g from an n-dimensional 
representation § of the same. J" associates with the linear 
transformation u in n-dimensional space a linear transformation 
U in N dimensions, so if §:s-—> wu is an n-dimensional repre- 
sentation of the group g with elements s, then 


s>u>r U 


is an N-dimensional representation s—> U of gq which we may 
denote by IO). To this is due the importance of the repre- 
sentations of the linear group for the general theory of repre- 
sentations. For example, take I” to be the representation of 
¢ whose gubstratum is the dual space, the space of all tensors 
of order 2, of the symmetric or anti-symmetric tensors of order 2, 
etc.; we then obtain from the representation © of the abstract 


group g the representation %, © x ©, [D * OI, {D x H}, ete. 
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The three most important formal processes are (1) addition, 
(2) x-multiplication, and (3) the I” process. The first two 
generate a new representation from one or two given repre- 
sentations, the third a new one from a given representation. 
The first two are completely circumscribed, but the third 
contains a general method, for £’ may be any representation of 
the linear group Cy. 

If g’ is a sub-group of g, then any representation 
§ :s—» U(s) of gq contains a representation of g’; we need only 
let the element s run through the sub-group g’! This too may 
be considered as a formal process (4) which generates a repre- 
sentation of g’ from a given representation of g. 

The X-multiplication occurs in yet another connection. 
Given two groups g, g’, we can consider the pairs (s, s’), the 
first member s of which is an element of g and the second s’ 
an element of g’, as the elements of a new group g X q’, the 
direct product of g and gq’, obeying the multiplication law 


(s, s’)(t, 2’) = (st, s’t’). 


The order of g X g’ is the product of the orders of g and q’. If 
§:s-—> U(s) is an n-dimensional representation of g and 
§)’ : s’ -» U'(s’) an n'-dimensional representation of g’, then 


(s, s') > U(s) x U'(s') (5.5) 


is obviously a representation in nz’ dimensions of the group 
q xq’; we denote it by § X ’ (with a boldface x). This 
construction may be broken up into two steps. First introduce 
the representation 


(s, s’) > U(s) 


of q x g’; there is no reason why we should not designate it 
by the same letter as the representation s > U(s) of g—we are 
accustomed to calling the function f(x), considered as a function 
of the two variables x, y, by the same letter as the function 
f(x) of the single variable x. U(s) and U’(s’) are thus to be 
considered as functions of the same variable pair (s, s’), and then 
the representation ) X $' of g x g’ may be obtained by ordinary 
X-multiplication from and §’. The differentiation between 
boldface % and ordinary X is accordingly purely pedantic. 


Examples. Unimodular Group in Two Dimensions 


Let g = c¢ = c, consist of all linear transformations s of two 
variables x, y: 


x =ax-+ by, y' = cx + dy, (5.6) 
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whose determinant ad — be = 1 (‘“ unimodular” linear trans- 
formations *). A homogeneous polynomial in x, y of order f is 
a linear combination of the f-+ 1 monomials 
A dV ys so, wy, yl (5.7) 

Under the influence of s they undergo a linear transformation 
which we denoted above by [s];; they constitute the substratum 
of a representation [c]/:s— [s], in f + 1 dimensions which we 
now denote by Gy. Gy is, although we have yet to prove it, 
irreducible. 

We can restrict ourselves within ¢ to the sub-group c, of 
‘‘ principal’’ transformations which transform each of the 
variables separately : 


, 


an, oS “y, (5.8) 


where a + 0 1s an arbitrary constant. c¢, 1s Abelian. This 
transformation multiplies the mcnomials of the set (5.7) by 


al qi-2 ce a~(f-2), af, 


On associating the number a’ with the element (5.8) of c, we 
obtain a 1-dimensional representation which we denote for the 
moment by ©); here r can be any fixed integral exponent. 
We have just seen that the irreducible representation Gy of cy 
is completely reduced on restricting ourselves to the sub-group 
¢, into f+ 1 one-dimensional representations ©) with r = f, 
f—2,-++,—/f. This is an example of the process (4). 

As an example of multiplication and addition we consider 
the problem of reducing the product Gy x ©, of the two repre- 
sentations @,, ©, of ¢ into its irreducible components. The 
result is contained in the formula 


| 
| CO, X 8 = 2, (5.9) 


where v runs through the series 
Ju=ftef+e-%---+|f—al | (5.10) 


without repetition, decreasing by 2 from term to term. This 
equation is essentially identical with the Clebsch-Gordan series 
which plays such an important rdle in the theory of invariants 
of binary forms. We shall see in the succeeding chapters that 
it may justly be considered as the fundamental mathematical 


* ¢, will usually denote the group of all non-singular linear transformations 
in #-dimensions; it will however occasionally be used to denote the more 
restricted unimodular group, in which case the restriction will be explicitly 
stated. 
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formula for the classification of atomic spectra and for the theory 
of the valence bond. 
The proof consists in showing that 


Se X © = Cry + Opa X Gy-1), (5.11) 


for (5.9) then follows by mathematical induction and the fact 
that obviously 


Cy x G, == Cy. 


A new co-ordinate system for the representation space of Gy 
is obtained by replacing the basis (5.7) of homogeneous poly- 
nomials of order f by another basis. In this sense we can say 
that the polynomials of order f constitute the substratum of 
the representation ©, The substratum of the representation 
©, x ©, is then the totality of polynomials 


b= Pxy ; En) 
depending on the components of two arbitrary vectors (vy), 
(€), homogeneous and of order f in the first, and homogeneous 
and of order g in the second ; we write the total order f + g = h. 
The ® are thus linear combinations of the (f+ l)(g + 1) 


monomials 
xtyk-&ye where 1+k=fpetnw=g. (5.12) 


Both vectors are transformed cogrediently under the same trans- 
formation s, (5.6). The problem consists in completely reducing 
the space of the polynomials ® into two sub-spaces (®)) and 
(®)’ which are the substrata of the representations ©, and 
Gr, XG, respectively. We first discuss the structure of 
these two sub-spaces. 

(D),. Expand 


(ax + By)(ak + Bn)? = ah dy + (p)ah MB dy +o + BEd 
(5.13) 


in powers of the undetermined coefficients a, B. The 
$; = $,(xy ; &n) are special polynomials of the type ® and span 
the sub-space (®)y. We must now show that this sub-space 1s 
invariant under the transformation (5.6) of the variables ; 
i.e. that ¢; = ¢,(x'y’; én’) is a linear combination of the 
@; = $,(xy; én). It is clear that if this is the case then ¢ in- 
duces the representation ©, in (®)9, for on identifying the two 
vectors 


f=*x, n=y¥ (5.14) 
dp; becomes 
pi(xy ; xy) = ait yl, 
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Hence we are certain a priori that the h + 1 functions ¢,; are 
linearly independent. 
In order to arrive at the desired proof we replace x, y in 
(5.13) by 
x’ = ax-+ by, y’ =cx + dy, 
and in the same way €, 7 by 


E’= a + bn, 9 = c& + an. 
Now note that ax’ + By’ is the linear form 
(aa + Bc)x + (ab + Bd)y = Ax + By 
in x and y; hence 
(ax" + By')S(ag" + Bn’)? = (Ax + By)I(AE + Bn)s, 
and by (5.13) 
h A-iQi. df’ — hA-i Pr. 
E(j)0 dim EAB 4. 
On replacing A, B on the right-hand side of this equation by 
A=aa+ Bc, B= ab-+ Bad, 
and equating coefficients of «*-'B', we obtain ¢, as a linear 
combination of the ¢,. 
(®)". The substratum of the representation @,, x ©, 
consists of the polynomials 
f= P(xy 5 &n) 
of order f — lin (x, y) and of order g — Lin (€, 7). They are not 


polynomials of type ®; in order to increase the order in the 
components of each vector by 1] we replace cach such ¥ by 


® = (xn — yb) > P. 
The factor thus introduced in no way affects the representation. 
The last step in the proof consists in showing that the total 
space of polynomials ® is completely reducible into these two 


sub-spaces; i.e. in showing that any polynomial ® can be 
written in the form 


D = (AyPo + AP: + ° + * + Aaa) + (xn — ye) (5.15) 
with unique constant coefficients a, (The development in 
terms of powers of the determinant xy — y& obtained from this 
by induction ts the Clebsch-Gordan series.) First, the dimen- 
sionalities are correct, for 

a Ee dy Pg al) ee 
Hence it suffices to show that the various terms in (5.15) are 
linearly independent, t.e. that an expression of the form (5.15), 


h 


L 
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n which ¥ 1s a polynomial of order f — 1 in (x, y) and of order 
*— ] in (€, 7), can vanish only if WY vanishes identically and if 
ll the coefficients a; are zero. The proof is extremely simple. 
Ne first let (&n) = (xy) as in (5.14), then the equation © = 0 
yecomes 


Agr" + ax ly + - . + apy’ = 0 


dentically in x and y; hence a; = 0. MHaving established this 
ve return to the two sets of variables xy; &y and obtain the 


‘quation 


rom which it follows that W == 0—in an algebraic identity 
or polynomials we may always remove a factor, such as 
cn — y&, which does not vanish identically. 

Our formula (5.9) also holds for the group c of all linear 
‘ransformations of x, y with non-vanishing determinant. We 
nust then interpret ©,, v = h — 2/ in (5.9) as that representation 
whose substratum 1s the totality of homogeneous polynomials 
of order v in x and y multiplied by (vy — yé&). In other words, 
-he new GW, differs from the old in that the transformation of 
che (v + 1)-dimensional representation space corresponding to 
; in the representation ©, is to be multiplied by the J“ power 
of the determinant ad — be. 

©, xX ©, is a representation of Cy X Cg, the group consisting 
of pairs (s, s’) whose members s and s’ run independently through 
the entire group ¢,. On introducing the restriction that s’ is 
the element S$ obtained from s by replacing the coefficients of 
the linear transformation s by their conjugate complex, ©; & @, 
becomes a representation Gy; , of ¢,, the substratum of which 
may be taken as the monomials 


xtyk wy (i+ k=f, e+e =g) 


of order f in (x, y) and order g in (%, ¥). It can be shown that 
Sy, is also irreducible. 


§ 6. The Jordan-Holder Theorem and its Analogues 


Perhaps the most fundamental theorem of mathematics 1s 
that on which the concept of cardinal numbers depends. Let 
the members of a finite set of objects distinguished by marks 
a,b,c* + + be exhibited individually in this order and associated 
with the symbols 1, 2, - ++”. The theorem then states that 
the ‘‘number”’ 2 is independent of the order in which the 
objects are exhibited. The proof of this theorem is of con- 
siderable mathematical interest and offers the simplest example 
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of the type of proof employed in establishing the Jordan- 
Holder theorem. A new enumeration consists in associating 
the symbol 1 with any one of the objects, the symbol 2 with 
any one of the remaining objects, etc., until the entire set is 
exhausted, the last object receiving the symbol n’. We assert 
that n’ = n. 

The proof is divided into two steps. (1) If in the new enumer- 
ation the symbol 1 is associated with the same object a as in 
the old, our theorem for the series from 1 to » is reduced to that 
for the series from 1 to 2 — 1. This is immediately evident on 
discarding the object a and reducing by one the symbols as- 
sociated with the objects 0, c, - + + in the new as well as in the 
old enumeration. (2) If, on the other hand, the symbol 1 1s 
associated with one of the other objects b, c, - + + then in the 
new enumeration the object a is associated with some symbol 
1 contained in the series 2, 3, - + +, »’. We now introduce a 
third enumeration which enables us to make the transition 
between the first and the second by interchanging the symbols 
l and 2 in the second enumeration. The number 7’ 1s obviously 
unaltered by this process. But we have now introduced an 
equivalent enumeration in which the object a is associated with 
the same symbol 1 as in the original and have reduced the 
general case to the one considered in (1) above. The proof of 
the theorem then follows immediately by the method of 
mathematical induction. 

As an auxiliary result of these fundamental considerations 
we have the theorem that any permutation can be obtained by 
the successive application of transpositions. 

The fFordan-Holder theorem is concerned with an abstract 
group g. An invariant sub-group gq’ of g which does not coincide 
with g itself is said to be maximal if there exists no invariant 
sub-group of q—except g’ and g-——containing g’. The factor 
group @/g’ is then simple, i.c. it contains no invariant sub-group 
with the exception of itself and that consisting only of the 
unit element |. As was recognized by Galois, the so-called 
composition series 


Bo = G, G1, Ga, ° °°) Drea, Ge = [ (6.1) 
is of fundamental importance for the solution of algebraic 
equations. This series begins with g and ends with ], and each 
member is a maximal invariant sub-group of the preceding 
member. We assume that the composition series terminates ; 
this is naturally the case for finite groups, as the order necessarily 
decreases from term to term. The successive factor groups 


9/91, G1/B2, ae Gr—1/Gr ca Qr—y (6.2) 
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are simple. The Jordan-Hélder theorem asserts that the 
structure of these factor groups, except for the order 1n which they 
appear, 1s uniquely determined by q. 

Consider, therefore, a second composition series 


So = G Gi Gar * 
of the same group g; it is to be compared with the ‘“ standard 
series’ (6.1). The proof of the fact that this new series also 
contains exactly ry + 1 terms and that the corresponding factor 
groups are, except for the order in which they occur, isomorphic 
with the factor groups (6.2) is again accomplished in two steps. 

(1) If the two second members Q’,, g, coincide, the theorem 
for the group g, whose standard series contains 7 -+ 1 members, 
is reduced to the corresponding theorem for the group g,, whose 
standard series contains but r members. 

(2) If g, and g’, do not coincide we construct the inter- 
section of g, and g’;, i.e. the set consisting of all elements 
common to the two. h is then an invariant sub-group of 9’, 
and, as we shall prove, g’,/h is isomorphic with g/g,. That 
two elements s, ¢ of 'g are equivalent with respect to q,, 1.e. that 
they belong to the same “ set,’’ 1s expressed by the equation 
t= a,s where a,ising,. Ifsandt#are at the same time clements 
of the sub-group q’,, then a, is also in g’, and consequently it 
is an element of h. We may therefore consider as the elements 
of g/,/h those sets in g which contain an element of g’, The 
elements contained in these classes then constitute an invariant 
sub-group § of g containing both g, and Qg’,, and g’,/b is simply 
isomorphic with §/g,. But since g’, is maximal either ) = 
or § = @q’,. The second case implies that g, 1s contained in q’,, 
and since it is maximal it must coincide with q’,, contrary to 
assumption. Hence § coincides with g and our assertion 1s 
proved. The intersection } of g, and q’, depends symmetrically 
on both, whence g/g’, and g,/§ are also simply isomorphic. 

We now proceed as follows. We construct a composition 
serics for }, which we denote simply by b, - - :, and compare 
the following four composition series of q: 


q, J1, Ja, ee 
g, Qi, h, me" 


6 Gy Gest’ 
The comparison of the first and second series is reduced to case (1). 


The second and third series agree from the member f) on, and 
the two foregoing factor groups 


g/G1, Gu/9 
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are, as we have seen, simply isomorphic with 


g/o'1, 9's/8 


on interchanging their order. The comparison between the 
third and fourth series is again reduced to the case (1). The 
proof of the theorem for composition series containing r+ 1 
members is thus reduced to the proof of the corresponding 
theorem for series with but r members, and since it obviously 
holds for r = 2 (i.e. for simple groups) the method of mathe- 
matical induction establishes its general validity. 

The close methodological agreement between the construction 
involved in the proof of this theorem and that involved in the 
proof of the independence of the cardinal number of a set of 
the order in which the objects are enumerated 1s immediately 
evident. 

FE. Noether* has given a generalization of the Jordan-Holder 
theorem which is of importance for us. A correspondence 
s—»> s’ = As of the group on itself is said to be automorphic it 
multiplication is invariant under it, 1.e. if (st)’ = s’t/—we here 
neither assume that different elements s generate different 
elements s’ nor that for a given element s’ there exists an element 
s such that s— s’ in virtue of the automorphism. Let 2 be 
a system of such automorphic correspondences of g. We now 
admit only sub-groups of g which are invariant under 2’, i.e. 
sub-groups whose elements are carried over by all operations 
of the system 2’ into elements of the same sub-group. We say 
that two such “ allowed” sub-groups g, and gq, have the same 
structure if we can set up a one-to-one simple tsomorphic 
correspondence between the elements of the one and the ele- 
ments of the other in such a way that every operation A of 
the system 2 sends corresponding elements of the two sub- 
groups over into corresponding elements. The Jordan-Hoélder 
theorem still holds under this modification; its proof can be 
aken over unaltered. 

The vectors of an n-dimensional vector space ®t constitute 
an Abelian group whose multiplication ts the addition + of 
vectors. We must for the moment supplement addition by 
the operation of multiplication of a vector by an arbitrary 
number; hence the concepts and theorems applying to vector 
space are not truly specializations of the concepts and theorems 
of Abelian groups, but there exists a thorough-going analogy 
between the two. Indicating this analogy between a group (on 
the left) and vector space (on the right) by ~ we have, for ex- 
ample, sub-group ~ linear sub-space, automorphism ~ linear 
correspondence. Indeed, a linear sub-space is a system §’ of 
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vectors such that with g and y their sum ¢ + ¥) and the product 
At by an arbitrary number XA also belong to ®’, and a corre- 
spondence ¢ > Y = Af is linear if it sends xy + Y and Az over into 
t’-+’ and Az’, respectively. Every ‘‘ sub-group” is here 
invariant, as we are dealing with Abelian groups. If ’ is 
a sub-space of R the space R (mod. R’) obtained by projecting 
R with respect to MR’ is the exact analogue of a factor group. 
A composition series consists of a sequence of spaces each 
member of which is a linear sub-space of the preceding one 
and has one less dimension. The last member is the space 0, 
consisting of the vector 0 alone, and the number of members in 
the series is 1 greater than the dimensionality n. The Jordan- 
Hélder theorem is here valid but trivial. 

On the other hand, this theorem 1s of considerable importance 
on going over to Noether’s generalization. Consider a system 
» of linear correspondences of the vector space on itself; the 
terms invariant, equivalent, reduction shall in the following refer 
to this system. Two invariant sub-spaces R, and §, are similar 
or equiyalent if a one-to-one linear correspondence {, 2 Y_ can 
be set up between the vectors of the one and the vectors of the 
other in such a way that any operation A of the system sends 
corresponding vectors over into corresponding vectors, On 
reading the series (4.3) established in § 4 backwards, we have 
the exact analogue of the composition series: each member of 
the series is followed by a maximal sub-space which 1s invariant 
under 2. (The possibility of constructing the composition 
series in increasing as well as decreasing order 1s due to the 
fact that the addition of vectors is commutative.) Furthermore, 
we can obtain the concepts and theorems relating to a system 
» of correspondences as genuine special cases of those of group 
theory, and not merely as analogues, by supplementing the 
system 2’ with all similarity transformations, 1.e. by all corre- 
spondences of the form y—> r’ = Ar representing multiplication 
by an arbitrary number A. The Jordan-Hdélder-Noether theorem 
now states: Given a second composition series 


0, Mi, R, ee a R, (6.3) 
the corresponding projection spaces 
Ht, Ie (mod. Rj), Ry (mod. Rj), - - > 


are equivalent to the projection spaces (4.3) 


Ri, Re (mod. Mt), Mt, (mod. Rt), eae 
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of the original series, taken in a suitable order. The number 
of members is, of course, the same in both. The reader is 
advised to reconstruct the proof of this theorem by carrying 
through the proof of the Jordan-Holder theorem step by step 
for this case. 

In particular, if the system 2 consists of the transformations 
U(s) associated with the various elements s of a group in a 
representation $ :s-—> U(s), our result yields the 

Uniqueness theorem: The irreducible representations separated 
off from & by successive reduction are completely determined by §, 
except for the order in which they occur, considering equivalent 
representations as the same. In particular, the complete reduction 
of § into irreducible components 1s unique, always considering 
equivalent representations as the same. 


§ 7. Unitary Representations 


For the case in which the representation space ® is unitary 
and the correspondences U(s) of ® on itself, associated with 
the element s of the group under consideration, are also unitary, 
certain of the concepts introduced above are to be modified 
accordingly. ‘Two representations 


s— U(s), s— U'(s) = AU(s)A™, 


are to be considered as equivalent only if A is unitary, ie. if it 
is a transformation from one normal co-ordinate system in 
R to another such. If ’ is a sub-space of R a unitary-orthog- 
onal co-ordinate system can be set up in §t’ and supplemented 
by additional fundamental vectors to form a complete unitary- 
orthogonal co-ordinate system for the entire space RR: every 
sub-space of a unitary space is per se unitary. Invariance and 
reduction remain as before, but we allow only those decom- 
positions of into two sub-spaces R, + MR, in which R,, R, 
are perpendicular. For a system of unitary correspondences 
reducibility implies complete reducibility and we have the theorem : 
If R is invariant with respect to & then R may be broken up into 
Ro + KR’ in such a way that R" is also invariant under X. We 
need merely to define Rt” as the space defined by all vectors per- 
pendicular to ®’. The theorem naturally holds for the case in 
which 2 is a system of infinitesimal unitary correspondences or, 
what amounts to the same, a system of Hermitian forms. The 
theorem developed in the preceding section proves that these 
irreducible components are uniquely determined, in the sense 
of (unitary) equivalence, to within a permutation. 
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Examples 
(1) The Unitary Group in Two Dimensions 


The group ¢ = ¢, of linear transformations in two dimensions 
contains the sub-group u=u, of unitary transformations. 
Hence the representation W, of ¢ obtained in § 5 1s also a repre- 
sentation of u. This representation 1s not unitary as it stands, 
but it can readily be made unitary by a slight change. The 
transformation of ©, corresponding to the unitary transforma- 
tion s of the co-ordinates x, y 1s that induced by s on the monomials 


x, = aye (i+ k=f) (7.1) 
of order f. For purposes of symmetry we label these co-ordinates 
with Sees n=1—k which runs through the values 


fif- + —f. This is also desirable because on restricting 
if ros to the sub- -group of “* principal transformations ”’ 


K+ EX, Yo =~y 


x, 1s multiplied by the factor e€". We now employ, instead of 
(7.1), the variables 


axryk 
Gp fee os (7.2) 
Vilk! 
obtained from them by multiplication with a constant. The 
representation W, of u will then be unitary, as follows from the 
equation 


We call ©; even or odd according as fis even or odd. The even 
representations associate the identity 1 with the reflection 


, Ul 


Me Se A YE oe ey 


and the odd associate with it the transformation — 1. Gy is 
also irreducible when considered as a representation of u, and 
on letting f assume the values 0, 1, 2,- + + they form a complete 
system of inequivalent irreducible representations of u. The proof 
of these assertions, which we employ heuristically in the follow- 
ing, will be given in Chapter V. On writing a homogeneous 
polynomial of order fin the variables x, y in the form 


LanXn 
the coefficients a, transform under the influence of a unitary 


transformation s like the components of a vector in the repre- 
sentation space of @,. 
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The complete reduction 
(Gy X ©) = Gyrg + Opa X G-1) 
was accomplished by breaking up the space of the ‘‘ polynomials 
@”’ into two invariant sub-spaces (®), and (®)’. We must 


now verify that these two sub-spaces are mutually orthogonal 
in the unitary sense. A general polynomial ® may be written 


Sawene co. \ a4 


where the x, are given by (7.2) and the &, are the corresponding 
monomials 


i= (te=g,e—K=>y»). 


Two such polynomials ® with coefficients a,,, 0,, are orthogonal 


if 
Dd 0e5 =O, 


The polynomial x;&,, whose highest coefficients as, = 1 while 
all others vanish, is to within a constant factor x/- £9 and is 
obviously perpendicular to all polynomials (®)’, for in all these 
latter the coefficient of x/£ vanishes. But under the unitary 
transformation 


six’ =ax+ By, y'’ = — Bet ay, (7.3) 
where ax + BB = 1, xé9 goes into 
(ax + By)I(ag + Bn)? (7.4) 


Since (®)’ and the orthogonality of polynomials are both in- 
variant under the unitary transformation s, (7.4) is also orthog- 
onal to (®)’ and, with the help of the definition (5.12) of (®)o, 
it follows from this that all polynomials of (®), are unitary- 
orthogonal to those of (®)’. 

(7.3) is the most general unimodular unitary transformation. 
This is derived in the same way as the familiar formula for the 
orthogonal transformations of two variables with unit deter- 
minant in plane analytical geometry. On writing the coefficients 


weg Benes (7.5) 


in terms of their real and imaginary parts we see that each such 
transformation is characterized by four real parameters x, A, p, v, 
the sum of whose squares 1s 1. The composition of two trans- 
formations s: («, A, p, v) is accomplished in terms of these 
parameters by Hamulton’s quaternion multiplication ; this latter 
led to the vector calculus. 
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(2) Unitary Groups in n-Dimensions 
The totality of tensors of order f is the substratum of an 
1/-dimensional unitary representation (u)/ of the group u = u,, 
‘or on denoting the components of an arbitrary tensor by 
P(t415 a 2) the sum 
ee: |F(iyte -_ iy) |? (7.6) 
(inns ig) 
is @ unitary invariant. On _ restricting ourselves to the 
{/-} dimensional linear manifold of anti-symmetric tensors we 


rake as the variables in tensor space those components 
F(tytg > + + tf) for which 243 <2g <---> <2, The sum (7.6) 
for these compénents only is, however, equal to the complete 
sum (7.6) divided by f!; hence the representation {ul of u, 
whose substratum consists of all anti-symmetric tensors, 1s 
unitary. The situation is somewhat different for symmetric 
tensors. The most general symmetric tensor of order f trans- 
forms like rXrx-+:-:xXdxz (f terms), ic. we may for the 
present purpose set 


ane he pte ge 1 f) — Xi, Xi. "oe oe Nips (7.7) 
We write the monomial on the right in the form 
xf ache « ee e@ xfn (5.1) 


as before; /, 1s the number of times the index r appears in the 
S€riCS 21, %, °° *, zy. In this sense we write the components of 
a symmetrical tensor 


F(tytg + + + t7) = Of, fo °° ty Sn): 


The sum (7.6) becomes in this case 


Se fs Baad Ae, 


extended over all integral f, 2 Ofor which ff + fe +°:: +f=f. 
The coefficient indicates how often the term |F (ii, ° + + @,)|? 
occurs in the sum in consequence of the fact that its value ts 
unchanged on permuting the indices. We must therefore 
consider the quantities 


Ve ae eae 


as independent components of an arbitrary symmetric tensor 
of order f in order to obtain a unitary representation [u]/. 
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The truth of this assertion follows from the fact that the special 
tensor (7.7) satisfies the equation 


xf o> e @ xJn ° zp o @ “gJn 


Arr a he 


alts apy Step XE)! = 2} 


We have already seen in I, §5 that a normal co-ordinate 
system can be so chosen that a commutative system 2 of 
unitary correspondences is completely reduced to a set of 
1-dimensional systems. The only «irreducible unitary repre- 
sentations of an Abelian group are accordingly 1-dimensional. 
For it follows from 


U(s)U(t) = U(st) (4.1) 


and the Abelian character of the group that the unitary matrices 
U(s) associated with the elements s are commutative. 

If © and §’ are unitary representations, then § + 9’, 
§ x §' are also.—The first fundamental problem for a given 
group g is to find a complete system of inequivalent irreducible 
unitary representations of g, for then any unitary representa- 
tion of g can be obtained by the addition of these irreducible 
representations. The second fundamental problem is to reduce 
the product § X §' of two irreducible representations {, 9’ of q 
into 1ts irreducible components ; or better (after having solved the 
first problem), to determine how often each of the irreducible 
representations occurs in this product. 

We illustrate these problems on the example offered by 
rotation groups, which are of particular importance in quantum 
physics. 


§ 8. Rotation and Lorentz Groups 


(a) The Group of Rotations in the Plane 


We describe the 2-dimensional plane by a complex co- 
ordinate x. The rotations of the plane are then given by 


X—> x = EX, (8.1) 


where € = e'* is a constant with unit modulus. (The rotations 
of the real 2-dimensional plane thus coincide with the unitary 
transformations of a single complex variable.) The angle of 
rotation ¢ determines the rotation completely, but it is of course 
only determined mod. 27 by the rotation. The angle of rotation 
behaves additively on composition: the rotation ¢ followed by 
the rotation ¢’ results in the rotation 6+ ¢’. This rotation 
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group 1s accordingly a one-parameter continuous Abelian group. 
We obtain a l-dimensional representation D™) of our rotation 
group 0 = Dd, by associating with the element e, (8.1), the linear 
correspondence 


Ly SS EP ey SS eit. (8.2) 


where m is any fixed integer. I assert that the D(™), m running 
through all integral values, constitute a complete system of 
irreducible unitary representations of D,. This can be seen as 
follows. 

Any irreducible representation 1s necessarily 1-dimensional : 
it associates with the rotation ¢ a number yx(¢) of absolute value 


1 such that 
x(o + $°) = x(P) + x(¢?). 


We assume that our representation 1s continuous; then x(@) 
is a continuous function of @ with period 27. First, y(0) = 1. 
We write x(¢) = e*4 and determine X(¢) uniquely by the require- 
ments that A(0) = 0 and that A(d) shall be a continuous function 
of d. We then have 


MP + 6’) = AP) + ALP’), (8.3) 


for the right- and left-hand sides of this equation could at most 
differ by an integral multiple of 27, but as it is written both 
sides agree for ¢’ == 0 and vary continuously with ¢’. (8.3) 
satisfies the condition A(O) = 0 and we obtain from it the further 


equations 
\M— $) =— AG), A(hd) = h- ALP), (8.4) 


where k is any integer. On replacing ¢ in the second of these 
equations by ¢/h we obtain 


a(2) = 7X4). (8.5) 


It follows immediately from (8.4), (8.5) that for every rational 
number k/h_ (k, h integers) 


A(t b) = 2A). (8.6) 


In accordance with our assumptions A(27) is an integral multiple 
Imm of 27. On setting d = 27 in (8.6) we obtain the equation 
A(f) = md for all ¢ which are rational fractions of 27; the 
continuity requirement then allows us to assert its validity 
for all real values of the argument ¢. 
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The simple equation 


Dim) x Dim’) —_ |(mtm’) 
is here valid. 


Consider the function f(p) on the unit circle in the complex 
x plane. If the point p goes over into the point p’ under the 
rotation ¢€, the function f goes into a function f’ which is defined 


by the equation 
I'(p’') = f(p)- 


The transition f— /’ is a linear correspondence in the oo-dimen- 
sional space of functions f(p) and is associated with the rotation 
¢; this obviously defines an co-dimensional representation of 
the rotation group D,, which we denote by &. S& is unitary if 
we take as the square of the absolute value of a ‘‘ vector’’ f 
the integral of |f(p)|? with respect to the element of arc dp on 
the unit circle. The fact that any function (satisfying suitable 
conditions) on the unit circle can be developed in a Fourier 
series means that in the reduction of & into its irreducible com- 
ponents each of the 1-dimensional representations 9") occurs 
once and only once. More precisely, this reduction is to be inter- 
preted with regard to the completeness relation. 


(b) The Group of Rotations in 3-dimensional Space 


We consider the functions f = f(P) on the unit sphere as 
the vectors of an oo-dimensional unitary space whose metric 


is given by \ LAP) Paden ; dw is the surface element of the sphere 


over which the integration is to be extended. If the point P 
goes over into P’ = sP under the rotation s, the function / 
goes over into the function f’ defined by /f’(P’) =f(P). The 
surface harmonics Y, of degree / [cf. II, § 4) obviously span a 
(2/ + 1)-dimensional sub-space ®, which is invariant under the 
totality of transitions f—> f’ induced in function space by the 
various elements s of the rotation group 0 = D,—here again we 
speak of this representation as &. They are consequently the 
substratum of a certain representation ®, of ) which is induced 
in ¥, by db. On choosing a definite direction as that of the 
Z-axis we may, as in II, § 4, take the set 


YY (m=11l—1,--+-+,—D 
as a basis for the surface harmonics of degree 1. We then have 


a unitary representation, and the sub-spaces ‘R, corresponding 
to the various values 0, 1, 2,- + - of / are mutually perpendicular 
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in the unitary sense (orthogonality properties of surface har- 
monics). 0 contains the 2-dimensional rotation group D,—e.g. 
as the sub-group of rotations about the z-axis. The structure of 
Y(™ shows that on restricting D,; to this sub-group Dd, the 
representation ®, 1s reduced into the 1-dimensional representa- 
tions D” for which m=l1, 1/—1,-°-°--°, —l The fact that 
any function on the unit sphere possesses a unique expansion 
in terms of surface harmonics means that on reducing & into 
its irreducible components each of the representations D,, 1 = 0, 
1, 2, - + +, occurs exactly once. This reveals the true signifi- 
cance of surface harmonics; they are characterized by the 
fundamental symmetry properties here developed, and the 
solution of the potential equation in polar co-ordinates is merely 
an accidental approach to their theory. 

Rotations are orthogonal transformations of three variables 
x, y, If we wish to include with the proper rotations with 
determinant + 1 also the improper ones with determinant — 1 
--““ augmented rotation group Dd’ '’—-this can be done by intro- 
ducing the reflection 


Ul , , 


tix’ = — xX, ypu-y vs=—2 (8.7) 


in the origin. Its reiteration 22 is the identity, and it commutes 
with all rotations. The matrix corresponding to it in the 
representation defined by the surface harmonics of degree / is 
the (27 + 1)-dimensional matrix (— 1)!, for the surface harmonics 
of degree 1 are homogeneous polynomials of degree / in x, y, 2. 
We can thus obtain two representations Dt, D> of the aug- 
mented rotation group from the representation D, of proper 
rotations; these two coincide with ®, for proper rotations, 
but in the first the matrix associated with the reflection 7 is + 1 
whereas in the second itis —1. Wecall this + 1 the signature 
of the representation. Hence in the oo-dimensional repre- 
sentation §& of the augmented group Dd’ each ®, occurs once 
with signature (— 1)’, but not with the opposite signature. 
Although we are not as yet in a position to prove it, the 
D, (t= 0, 1, 2, °° +) constitute a complete system of in- 
equivalent irreducible (single-valued) representations of the 
rotation group 0, and the 9°, D> together constitute such a 
system for the augmented rotation group 0’. 

Now consider the unitary function space of all functions 
f(P) in 3-dimensional space for which the integral |/|? over all 
space 1s finite. Let the representation induced in this space 
by rotations s, in which the transition from / to the transformed 
function f’ = sf is associated with s, be denoted by © Each 
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function f{(P) can be expanded in a series of terms of the form 
¢(r): Y,. Choose a complete orthogonal system ¢,(r), ¢.(r), * + ° 
in the domain of functions ¢(r) of the radius 7, in the sense of 


the equations 
12.6] 


(77D a(t) balr)dr == O44: 


0 


The functions of the form ¢,(r)- Y, then constitute a (2/ + 1)- 
dimensional sub-space ¥,, which is invariant under rotations 
and in which © induces the representation Q,. Different R,, 
are mutually unitary-orthogonal. Each 2, then appears in 
& infinitely often, its various occurrences being distinguished by 
the ‘‘radial quantum number” ». Consider the analysis of 
single electron spectra given in Chap. II, § 5, in the light of these 
mathematical developments. We then see that the azimuthal 
quantum number / is of purely group-theoretic significance, 
whereas the radial quantum number ~» refers to the dynamical 
situation, for the manner in which the orthogonal system ¢,,(7) 
is to be chosen is determined by the dynamical differential 
equation. 

The proper rotations of 3-dimensional Euclidean space about 
the origin of Cartesian co-ordinates x, y, 2, i.e. the real orthog- 
onal transformations with determinant -++ 1, are most easily 
represented by a stereographic projection of the unit sphere 
about the origin on to the equatorial plane z = 0, the south pole 
of the sphere being the centre of projection. If the point 
(x’, y’, 0) be the image on the plane of the point (x, y, 2) on the 
sphere and we write ¢ = x’ + 7’, the formule for the projection 


are 
ee! 2 _1-¢ 
i ae Es a clea as eo 


But it is preferable to introduce the two homogeneous complex 
co-ordinates £, 7 in place of € by means of the equation C = w/E ; 
the south pole €: 7 = 0:1 1s then included. We then have 
x+wyix—wyw: 2 : l = 
QnE : 2H :€& —yH: £6 + 7. 
Accordingly each unitary transformation 


o:f = af + By, 7! = y€ + 8 


of the co-ordinates €, 7 corresponds to a rotation s of the sphere, 
the points of which are represented by the rays &: of 2-dimen- 
sional unitary space. Since, as is readily seen, any point and 
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tangential direction through it on the sphere can be carried 
over into any other such configuration on the sphere by means 
of such rotations, we obtain in this way all rotations. Since 
we are only concerned with the ratios of the coefficients «, B, 
y, 6, the arbitrary factor of proportionality may be chosen in 
such a way that the determinant of the transformation is 1. 
Nevertheless this normalization is somewhat artificial as the 
correspondence 1s still double-valued, for on multiplying the 
coeficients of the unitary transformation by — 1, 1.e. on going 
over from o to — a, the normalization 1s unaffected. Hence tu 
each element oa, (7.4), of the wnanodular unitary group u corre- 
sponds a rotation s:o->s under which the co-ordinates 
x-+ 1y, x — ry, 2 transform like 


anf, 289, §F — ni, (8.8) 


Or 
ew kt bi, yr Unb &i), 2~ kai (89) 


(The symbol ~, which we occasionally employ, means that the 
expression on the left transforms like the one on the right.) 
We obtain in this way all rotations, each one exactly twice. 
The rotations about the s-axis are obtained from the ‘ principal 


transformations ”’ 


,_ | 
E aoe ef, ae eg! 


of u. In fact, on setting € = e = e(w) the angle of rotation 
about the z-axis is ¢ = — 2w. In virtue of the correspondence 
o —» s the rotations in 3-dimensions constitute a representation 
of the group u; and, conversely, the association s—o is a 
representation of the group D == 0, of 3-dimensional rotations 
by u, although this representation its double-valued. In virtue 
of this correspondence s + o any representation U(o) of u yields 
a representation of dD, (‘‘ I’ process,” § 5); ©, may thus be thought 
of as a representation of D3, in which case we write it D,, where 
= 52. The (‘‘even’’) D, with integral j are single-valued, 
those with half-integral (i.c. half an odd integer) 7 are double- 
valued. On restricting the group D, to the sub-group 0d, of 
rotations about the z-axis D,; is reduced into the 27+ 1 one- 
dimensional representations ®™) (m= j7, ; — 1, °° :+,— 9). To 
show this we first note that the substratum of our representation 
®, consists of the monomials (7.2) 


Eink 
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where m runs through the values j, 7 —1, ---, —jy. The 


transformation induced on these variables by a rotation ¢ 
about the g-axis is accordingly 


x(m) —-> e(— mo) - x(m). 


The representation o > s of wu is itself contained among the 
representations , of u constructed above; it is, in fact, ®,. 
To show this we note that if (&, 7), (&, 7’) be subjected to oe 
same transformation o of u, then the determinant En’ — nf’, 
well as €€ + ni, is invariant. Consequently (£, 7) transform co- 
grediently to (’, — €’), or as (y, — €); hence 


xt+y~wn, x—typyrw— & a~r &y. (8.10) 


The representations , with integral 7 are identical with those 
obtained above as the representations induced on surface har- 
monics of order j, for each polynemial in x, y, 2 of degrce 7 1s, 
in virtue of (8.10), equivalent to a form of order 27 in &, ». 

If we wish to augment u = Uy, in a manner paralleling the 
augmentation of ) = Dd, by the improper rotation 72 (reflection 
in the origin) we must consider it as an abstract group rather 
than a group of linear transformations in two variables. Denote 
the element corresponding to z by e¢ and the elements of the 
original u by o as before. We define the augmented wu’ as the 
totality of elements of the types o and to; ¢ must naturally 
obey the multiplication laws 


c= co, w= hb 


@* and © are then those representations of uw’ which coincide 
with ©, for elements of the restricted group u and which as- 
sociate with the element c the unit matrix + 1 and its negative 
— 1, respectively. The sign + 1s again called the signature. 
The representation ©> associates the augmented rotation group 
dD’, with w. 


(c) The Lorentz Group 


Let the 3-dimensional Euclidean space be referred to homo- 
geneous projective co-ordinates x, (a = 0, 1, 2, 3) defined by 


x x x 
Pas | a ye eS 


The equation of the unit sphere is then 
—xa+xit + xk = 0 (8.11) 
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and the formule for the stereographic projection considered 
above become 


%q = €E + 7M, x, = €n + a€ 
1 


2 > ies ap 8.12 
v_ = ($4 — 7), %a = 6 — iy | ate 
On subjecting €, 7 to an arbitrary linear transformation o the 
x, undergo a corresponding real linear transformation s which 
leaves the equation (8.11) invariant. If the absolute value of 


the determinant of o 1s 1, we can readily show that the form 
— xe txt ht x? (8.13) 


is itself invariant under the corresponding s, and that the deter- 
minant of sis + 1. 

We now consider %) == cl, X,, %2, %3 as the co-ordinates of 
space-time; (8.11) is then the equation of the light-cone, the 
gencrators of which are the possible paths for a beam of light. 
In the restricted theory of relativity normal co-ordinate systems 
for space-time are connected with each other by arbitrary 
Lorentz transformations, 1.e. by any real linear transformation 
which leaves the form (8.13) invariant and which does not 
interchange past and future. Lorentz transformations con- 
stitute a group, the ‘‘ complete Lorentz group,” and this group 
describes the homogeneity of the 4-dimensional world. This 
group consists of ‘‘ positive’’ and “ negative’’ transformations, 
i.e. transformations with determinants + 1 and — 1, respectively. 
The first constitute the “‘ restricted Lorentz group,” from which 
the complete group is obtained by introducing in addition the 
spatial reflection 


Ky > Xo, Xe—> —*X, (a = 1, 2, 3). (8.14) 


Under the restricted group right and left, as well as past and 
future, are fundamentally different. Since the expression for 
X» in (8.12) is positive definite, we may state the result obtained 
above in the form: avy linear transformation of &, n, with deter- 
minant of absolute value 1, induces a positive Lorentz transforma- 
tion sin the x,. ‘Transformations o which differ only by a factor 
e'4 of absolute value 1 give rise tothe same s. The correspondence 
o ~> s is naturally a representation. 

The question of whether every positive Lorentz transformation 
s can be obtained in this way arises immediately. That this 
is in fact the case can be seen from general continuity con- 
siderations, for the positive Lorentz transformations constitute 
a single connected continuum. But it 1s also easily proved by 
elementary methods. Since we have seen in (b) above that the 


9? 
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rotations of space s are obtained from the unitary transforma- 
tions a, we need only to examine the Lorentz transformation 


1 
(Xp + %3) > a*(Xq = x3), (Xo ri Xs) —> ~3\%o — %3), 
Ny > Hy, Xa > Xo. 


affecting the time axis, where a is a real non-vanishing constant. 
But this transformation is obtained from the unimodular oc: 


1 
E—> a€, ee 


Returning to the general case, the correspondence s—>o 1s a 
2-dimensional representation of the restricted Lorentz group. 
But o is determined by s only to within the arbitrary “ gauge 
factor’’ e’4; we may therefore normalize it by the condition 
that the determinant of o shall itself be unity, not merely its 
absolute value. Even so, o remains double-valued, for — o 
satisfies the normalizing condition as well as o. This repre- 
sentation s—o contains the representation of the rotation 
group considered in (b) on allowing s to run through the sub- 
group of spatial rotations contained in the restricted Lorentz 


proup. 

The expressions (8.12) are Hermitian forms with matrices 
c 10 0 1 0 —1 (8,15 
= Hoap ha of PIs of 22 fo —af P 


Hence if x denotes the one-columned matrix with elements &, 7 
equations (8.12) may be written 


2S. oe. (8.16) 


On replacing €, 7 by 4, — & the x, undergo the spatial re- 
flection (8.14). That is one way of including the negative 
Lorentz transformations, But if we require that the corre- 
sponding transformation of €, 7 be linear, we must introduce in 
addition to ry = (€, 7) a second pair r’ = (€’, »’) which undergoes 
the transformation o’ contragredient to @& Then 


(7, — €)~(E', n’) to within the factor d, 

(yn, — &)~(E, 7’) to within the factor d 
where d is the determinant of o. Defining 

SS Oe. Se =O. «toe. 2.3) 


re ao 


) 


the quantities 
wy = US, E 
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undergo the same transformation s as (8.16), provided the 
absolute value of the determinant of o is 1. The same is true 
for any linear combination of the two, e.g. x, + x. Hence the 
quantities 


X= ESTES. (8.17) 


undergo the given positive Lorentz transformation s when &, 7 
are subjected to a certain transformation o and simultaneously 
é’, »’ to the transformation o’ contragredient too. l*urthermore, 
they undergo the transformation (8.14) on interchanging the two 
pairs x, x’, 1.e. on subjecting the four variables to the trans- 
formation 


T:€E>€, non; & >& 7 >. (8.18) 
The expression 
cE" + aj’ 
is variant in virtue of the transformation law of €’, n’ defined 
above. To obtain an expression which is also invariant under 


the interchange (8.18) we must add to the above the expression 
obtained froin it by this interchange : 


(EE ay te (66 a). (8.19) 


It will be found advantageous to denote the column con- 
sisting of the four elements (€, 7; &', n’) by a single letter r. 
Let that linear transformation of these four variables which 
transforms &, 7 in accordance with S, and &', 7’ 1n accordance 
with S*, be denoted simply by S,: (8-17) then becomes 


Le Cree (8.16’) 


We must now ask to what extent the linear transformation o 
of the four variables r 1s determined by the requirement that 
it induce a given (positive or negative) Lorentz transformation 
s of the Hermitian forms x,. It suffices for this purpose to 
inquire what transformations of the x induce the identity on 
the variables x,. The only transformations of this latter kind 
are those which multiply €, 7 with a common factor e'4 of absolute 
value 1 and at the same time €’, 7 with any factor e'?”’ (inde- 
pendent of the first) of absolute value 1. But o can be more 
precisely specified by the requirement that (8.19), 1.e. rT 7, be 
also invariant. The two arbitrary ‘‘ gauge factors’? e'4, et 
must then coincide: the substitution o ts then determined to 
within a factor e'?. 

Our analysis reduces the problem of the representations of 
the Lorentz group to the corresponding problem for the uni- 
modular linear group Cg. 
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§ 9. Character of a Representation 


The trace of a linear correspondence A, i.e. the sum of the 
elements in the principal diagonal of the matrix A, 1s an in- 
variant under transformations of co-ordinates which 1s of 
particular importance. The trace y(s) of the correspondence 
U(s) associated with the element s of the group gq in a repre- 
sentation © of g is called the group characteristic, or, in 
order to avoid assigning yet another meaning to this second 
word, which has already appeared in another important con- 
nection in quantum mechanics, simply the character of the 
representation §). FEquivalent representations have the same 
character ; the name is so chosen because the converse of this 
theorem is true within wide limits. Since U(§) = 1, the value 
of the character y(1) for the unit element is equal to the dimen- 
sionality of the representation. 

It follows from the equations 


U(asa™) = U(a)U(s)U(a7) = U(a)U(s)U~\a) 


that the matrices U(s) and U(asa~') differ only in their orienta- 
tion and consequently have the same trace: 


x(asa~") = x/(s). 


Now s and asa“! are any two conjugate elements of the group @g, 
i.e. they belong to the same class of conjugates in the sense of 
§ 3. We speak of a function f(s) on the group manifold which 
has the same value for all elements s belonging to the same 
class as a class function ; such a function can at most allow us 
to distinguish between different classes, but not between ele- 
ments of the same class. The distinguishing feature of class 
functions can also be expressed in the equation 


f(st) = f(ts). 


The validity of this equation for f = x follows from 

U(st) = U(s)U(t), Ul(ts) == U(t)U(s) 
and the fact that the trace of the matrix AB 1s equal to the 
trace of BA. 


The character y(s) of a unitary representation : U(s~!)=U*(s), 
satisfies the equation 


x(s~*) = x(s). (9.1) 


We shall say that the chdracters of irreducible representations 
are primitive. Any unitary representation $ can be reduced 
into its irreducible components, and the normal co-ordinate 
system in the corresponding sub-spaces can be so chosen that 
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two irreducible constituents are equal if they are equivalent. 
If in this sense 
H=mh+my+--., (9.2) 
where §, ’, - + + are inequivalent irreducible representations and 
m, m' +++ are the numbers of times they occur in , then the 
character X of § 1s expressed 1n terms of the characters x, y’,: °° 
of h, h’, - : - by the equation 
X(s) = mx(s) + mx'(s) +o (9.3) 

From an n-dimensional representation $:s5-—> U(s), with 
the character y(s), and an n’-dimensional ’:s5— U'(s) of 
character x’(s) we can construct the (’)-dimensional repre- 
sentation {) x {’. The elements in the principal diagonal of 
U(s) x U'(s) are obtained by multiplying all elements in the 
principal diagonal of U(s) by those in the principal diagonal 
of U'(s): the character of 9 X Sy’ 1s consequently x(s) y'(s). Again, 
if §) is a representation of the group g, $° a representation of 
the group q’, then the representation ) & 9’ of g x q’ has the 
character € defined by 

(5, 5°) = x(s) x'(s)), (9.4) 
where s runs through the elements of gq and s’ those of q’. 

We need not distinguish between a I-dimensional repre- 
sentation and its character; the character satisfies the simple 
equation (4.2). This holds, for example, for the characters 
e(md), eq. (8.2), of the rotation group Dy. 

By the theorem on the transformation of unitary correspond- 
ences to principal axes, each clement of the group uw == uy ts 
conjugate to a principal element, 1.e. an element of the form 


€ “| 
: 1}. ie) (9.5) 
E 


The characteristic values ¢, I/e are determined to within the 
order in which they appear. Introducing the angle w by the 
equation € := e(w), w characterizes a class of conjugate elements 
of uw; we are only concerned with w mod. 27, and furthermore 
the class —w coincides with the class w. Since for any re- 
presentation © of u the character x(s) depends only on the class 
of the element s, it suffices to calculate it for elements of the 
form (9.5). It must be a periodic function of the angle w with 
period 27, and it must furthermore be an even function of w ; 
its value for Gy ts 


xpeehtefh%?+-+-fet= 


Se) 
e—e} 


(9.6) 
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The characters of the representations considered in_ the 
other examples of the preceding section are just as readily 
calculated. 


§ 10. Schur’s Lemma and Burnside’s Theorem 


Lemma (10.1).5 Assumption. Let 2 be an irreducible system 
of linear correspondences of an m-dimensional vector space t 
on to itself, and §2 such a system of an m-dimensional vector 
space 8. A linear correspondence A shall satisfy the equation 


ZA = AQ (10.2) 


in the following double sense: for each U of 2 there shall exist 
a V of 2 such that 


UA = AV, (10.3) 


and conversely for each V of 92 there shall exist a U of 2’such 
that this relation 1s fulfilled. 

Assertion. Exther A = 0 or m= un and det A +0; in the 
latter case 2 and 22 are equivalent. 

Proof. We first make use of the assumption that 2’ 1s 
irreducible in connection with equation (10.2) in the first sense. 
Considering the k™ column 


Aye, Gor, * * *, mk 
of A as a vector al*), equation (10.3) asserts that the vector 


Uat*) associated with a) through the correspondence U is 
a linear combination of the vectors a), specifically that 


Ual*) coca Dvny al), V = ||orx! |. 
h 


Consequently the sub-space of t spanned by the n vectors a‘) 
is invariant under 2. But because of the assumption that 2’ 
is irreducible either a) == 0, A = 0, or the a) span the entire 
space t, in which case m of them are linearly independent ; 
this latter 1s possible only if m 2 m. That our conclusion 
contains two possibilities 1s due to the fact that the concept 
of irreducibility contains such an alternative. 

The second part of the assumption can be given a simple 
geometrical interpretation on going over to the transposed 
matrices: J2* is irreducible and for each V* of 92* there exists 
a U* of &* such that 


Ve Ate ATO, 
The reasoning employed in the first part of the theorem allows 


us to conclude: either A* = 0 or m =n. We summarize the 
results thus far obtained in the statement: Either 4A = 0 or 
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m==n; in the latter case the m =» columns at) of 4 are 
linearly independent, 1.e. the determinant of A does not vanish. 
But then U and V are determined uniquely by the relation 
(10.3) and 2 and 22 are equivalent. 

In formulating these results it is desirable to consider the 
case of equivalence separately : 

I. If the two irreducible systems &, 22 are inequivalent, (10.2) 
can only be satisfied by A = 0. 

Il. Uf Lis an irreducible system a correspondence A commutes 
with all correspondences U of the system 2’: 


UA AU (10.4) 


if and only tf Ais a multiple of the unit matrix 1. 

Assertion IT follows from the lemma proved above by 
elementary methods and the fundamental theorem of algebra. 
For by the latter there exists a number « such that 
det (4 — a1) = 0, and since A’ == A — al satisties (10-4) for 
all U if A does, we conclude that since det A’ -= 0 we must 
have “l’:= 0, 

Applied to representations, our results are: 

Fundamental Theorem (10.5). 1. Lf s— U(s), s—> T(s) are 
two inequivalent irreducible representations of a group q, the 
equation 

O(S)eb a2 | (5) 


can be satisfied by no matrix A which ts independent of s, except 
ee 

Il. cf matrix .4 which is independent of s and which satisfies 
the equation 


U(s) 4 = AU(s) 


for all s 1s necessarily a multiple of the unit matrix 1. 

If there exists a matrix .f which satishes U(s)d = AU(s) 
identically in s and which is not merely a multiple of the unit 
matrix 1, the argument employed above supplies us with a 
constructive process for the reduction of the representation 
s-> U(s) with the aid of A. 

We now consider an application of these important results, 
which are fundamental for the entire theory of representations, 
in order to prove a theorem due to Burnside. Let & be a 
multiplicative system, 1c. if U, U’ are two correspondences in 
»' then the product UU" is also a correspondence in 2. This 
concept is somewhat wider than that of a group; we need not 
require that U possess an inverse-—its determinant may be 0. 

Burnside’s Theorem (10.6).6 In an irreducible multiplicative 
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system = of linear correspondences U = ||u,4|| of an n dimensional 
vector space on to itself the components u,, are linearly independent. 
This asserts that the only matrix Z which satisfies the equation 


tr (UL) a dD lei Uik oe 0 
1, k 


for all matrices U of the system is L = 0. Contrary to the 
assertion, we assume there exist non-vanishing matrices satis- 
fying this equation; such matrices we shall call L-miatrices. 
It 1s of course possible that every L-matrix whose first column 


yy, los, Re bat 


vanishes must itself vanish. But in any case we can find a 
definite column index h with the following properties: there 
exist non-vanishing £-matrices whose first hk — 1 columns 
vanish and are such that if the h'® column also vanishes then 
necessarily == 0. We shall call L-matrices whose first h — 1 
columns vanish special L-matrices. They constitute a linear 
family of m <n dimensions; we denote a basis for this family 
by 


LQ), LR eee Lon), 


) 


The h'® column of a-special L-matrix will be written [. 
Since 2' 1s multiplicative the equation 


tr (U'UL) = 0 


is satisfied by each .-matrix, where U, U’ are arbitrary corre- 
spondences of the system 2. With L, UL is also an L-matrix ; 
obviously it is a special L-matrix if Lis. Fach of the matrices 


UL®), UL®, +++ UL) 


is therefore a linear combination of L!, ++ +> L() and each of 
the vectors U{@) +> +--+ Ul!™ is a linear combination of the 
vectors [f') + + + [(), Accordingly the vectors [(), + + + [{() 
span a non-vanishing sub-space which is invariant under all the 
correspondences U, and in consequence of the irreducibility 
assumed above it follows that m =m and the vectors [[), + + >, 
((") span the entire n-dimensional space. The basis L], +--+, L(@ 
of the family of special L-matrices can be chosen in such a way 
that [@), +--+ + [(@) are the fundamental vectors of the space ; 
{@) is then the column (1, 0, 0, ++ -, 0), etc. Since then 


Ut) = uv, 1) +--+ + + 1M (10.7) 
we must also have 


UL) = u,, LO +++ +4 u,,L™). (10.8) 
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We now consider an arbitrary column, say the Rk, of L. 
(This is of course of no interest if k <h, for the first h — 1 
columns vanish.) Suppressing the second index k, we now let 
{ = (l,,- °°, /,) denote the k column of L. Then in accordance 
with (10.8), equation (10.7) holds for the present I, i.e. the &'* 
instead of the h'* column of L. Introducing for the moment the 
matrix 


I... KK 
KY. om 


consisting of the Rk columns of L@), +--+, L(), we may write 
(10.7) as the matrix equation 


UA == AU. 


But it follows from this that A must be a multiple of the unit 
matrix, 1.€. 


Wd, B= Ly OSG 


or, returning to the original notation by adding the column 
index k, 


[\) —= A, ’ Of. 


Here we have, by the foregoing, Ay == + + + == Ay_y = 0, A, = 1. 
The equation 
tr (UL) == 0 
becomes 


DD hae Sv, (eed, © 2, oD), (10.9) 
ko 


ic. all correspondences of the system 2* carry the vector A 
with components (Ay, Ag, * + *, A,) over into the null-vector. 
In consequence of the irreducibility of 2 this vector must there- 
fore vanish, which is in contradiction with the equation A, = 1; 
Burnside’s theorem then follows by reductio ad absurdum.—lIf 
we know that the unit matrix is contained in the system 2, as 
is the case for a representation, we can conclude that A; = 0 by 
taking U in (10.9) as the unit matrix. 

Reducibility requires that on employing an appropriate 
co-ordinate system all matrices U of the system 2 have an 
entire rectangle of vanishing elements and consequently implies 
a system of homogeneous linear relations between the components 
uj, of a very special kind. Burnside’s theorem states that if 
there exists no system of homogeneous linear relations of this 
special kind, then there exists no linear dependence at all. The 
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real reason for this remarkable fact is of course to be found in 
the assumption that 2’ is closed with respect to multiplication. 

If our system 2 consists of an irreducible representation 
which associates with the elements s of the group q the matrix 
U(s), we see from Burnside’s theorem that the components of 
U(s) are linearly independent. The method developed above 
can readily be extended to prove the same for the components 
of two or more inequivalent irreducible representations U(s), 
U'(s), - + +7 From this it follows that in particular there can 
exist no linear dependences between their characters x(s), x'(s), + °° 
Any unitary representation { can be reduced into irreducible 
components; the character of $ is expressed in terms of the 
characters of these irreducible representations by (9.3). Since 
x(s), x’(s) are linearly independent the coefficients m, m’, + > «, 
which give the number of times the irreducible representations 
h, h’, - + - appear in §, are uniquely determined. This con- 
stitutes a new indirect proof of the following result, which has 
already been proved in § 6 in a more general and more elementary 
way: The irreducible representations into which § can be reduced, 
as well as the number of times they occur, are uniquely determined 
by §), no distinction being made between equivalent representations. 
Two unitary representations §), and {, are obviously equivalent 
if every irreducible representation which ts contained in the one 
is contained in the other the same number of times. Hence 
if §, and $, are inequivalent the character of , cannot be the 
same as the character of §, because of the linear independence 
of the primitive characters: a unitary representation 1s uniquely 
determined by its character alone, and its character may be used 
as a unique name for the representation itself. We here go no 
further into these extensions of Burnside’s theorem, which are 
due to Frobenius and I. Schur, as we shall obtain the same results 
by a more profound method in the next section under assump- 
tions which are more restrictive but which are sufficient for 
our purposes. 

We mention only one consequence. §, §’ being representa- 
tions of the groups g, 9’, respectively, then § X $’ is an irreducible 
representation of g x g’. Indeed, there can exist no homo- 
geneous linear relation with constant coefficients ¢j,, . between 
the components u,,(s)ui,(s’) of U(s) x U'(s') except the trivial 
one c=0. For on applying Burnside’s theorem for the 
irreducible system we have 


, 
pa Cik) (K Uy (S’) eae 0, 
t,x 


and on applying it again for )’ we must have ¢ix, « = 9. 
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§ 11. Orthogonality Properties of Group Characters 


If the abstract group g is finite, then any representation 
§:s-—» U(s) 1s equivalent to a unitary one. To show this take 
any positive definite Hermitian form, e.g. the unit form, subject 
it to all transformations U(s) of § and sum over s, We thus 
obtain a positive definite Hermitian form H which 1s invariant 
under each of the transformations U(s). Now choose the co- 
ordinate system in such a way that H becomes the unit form ; 
then U(s), expressed in terms of these co-ordinates, is unitary. 
This same method of summation over the elements of the group 
gives rise to the fundamental orthogonality relations. 

Let $:5-— U(s), $':5— U'(s) be two inequivalent irre- 
ducible representations of the finite group g, the former being 
g-dimensional and the latter g’-dimensional. We write 


U(s) == ||ere(s)|], U"(s) = |eals)|], 
U's) == [lmi(s)| |. 
For a unitary representation §)’ 
u,,(5) = THe 
If A is an arbitrary matrix with g rows and g’ columns then 
obviously the sum 


DSUDAU' Vt) = B, (11.1) 


taken over all elements ¢ of qg, 1s invariant in the sense that 
USsepU '(s) == B. (11.2) 
In fact, the left-hand side of (11.2) becomes, in virtue of the 
fact that s-—» U(s) is a representation of g, 
SUDA (4). 
where 7 == st, s being fixed and ¢ running through all elements 
of the group. We therefore obtain equation (11.2) or 


U(s)B == BU(s). 
In accordance with the fundamental theorem (10.5) it follows 
from this that Bb = 0, Le. 


SD ix (t)Ayatie(t) = 0. 
t k.« 


Writing s in place of t and remembering that the a,, are arbitrary 
numbers, we obtain the g?- g’® equations 


Zuye(s)X(s) = 0, 
or, in dealing with unitary representations, 
DUsx(S) Mx (S) = 0. (11.3) 
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Taking the single irreducible representation s—» U(s) in- 
stead of the two inequivalent representations 9, §’, we find by 
the same argument that the square matrix 

U(s)AU-\(s) = B, 
found from an arbitrary square matrix A, must satisfy the 
U(s)B = BU(s). 


This requires, however, that B be a multiple of the unit matrix 1, 


1.€, 
Da Pa Usn(S)Ape Ug (S) = a Sy. 
8 , 


the number « depends on the matrix A, the dependence being 
of course linear and homogeneous. Taking as A that matrix 
which has as its only non-vanishing element a,, = 1, we obtain 
the equation 


DU jx (S)Ue(S) == Oe Oy: (11.4) 
Now ||7,.(s)|| is the matrix reciprocal to ||1,,.(s)|| : 
Dy Uni (S)Usx(S) os Oxk- 
On taking «=17 in (11.4) and summing over 1= 1, 2,°°°, g 
we find that 
h- Sek = £hck; 

where h is the order of the group g. 

Expressing the sum 3’ in terms of the mean valuc M = : pe 


8 
our results may be written in the form 


i read 


Miu i.(s)a.(s)} = 128 (11.5) 
0 otherwise 
for any irreducible unitary representation ): s > U(s) and 
Mw sx(s)H,.(s)} = 0 (11.6) 


for any two inequivalent irreducible unitary representations 
s—» U(s), s—> U"(s). The components of one or more inequivalent 
trreducible unitary representations constitute a unitary-orthogonal 
set of functions on the group manifold. 

It follows from these fundamental orthogonality relations 
that the components u,;(s), ui.(s), °° * are linearly independent. 
Since the number of linearly independent functions of an argu- 
ment s which assumes but A values cannot be greater than h 
we must have 


ge+gt+: +s SA. 


ORTHOGONALITY OF GROUP CHARACTERS | 159 


On the left-hand side of this equation occur the squares of the 
degrees of any inequivalent irreducible representation of g. 

We obtain the orthogonality properties of the characters 
by writing k=12, « =e in (11.5), (11.6) and summing over 
these indices : 

Any primitive character satisfies the equation 


M{x(s)x(s)} = 1, (11.7) 


and the characters x(s), x‘(s) of any two inequivalent irreducible 
representations satisfy 


Mix'(s)x(s)} = 0. (11.7") 
The primitive characters of inequivalent representations constitute 
a normal orthogonal set of functions. They are consequently 
lincarly independent, and from this follow all the consequences 
discussed in the previous section. In particular, a representation 
of g can be unambiguously described by its character, no dis- 
tinction being made between equivalent representations. The 


number of times m the irreducible x occurs in the representation 
X is, following (9.3), given by 


m = IRX(s)x(s)}, 11.8) 
and we have 
MX (s)X(s)} — ym? -+- m2 + cee 


This last equation offers a simple criterion for the irreducibility 
of a given representation in terms of its character y: tt 1s neces- 
sary and sufficient that the mean value of xX = |x|*—which is in 
any case integral —be unity. 

Since the characters are class functions we are in dealing 
with them concerned with an argument which runs through 
the K different classes of q; there can therefore be no more 
than A linearly independent class functions. Hence a finite 
group can have no more inequivalent irreducible representations 
than classes. 

Whereas the general concept of a representation seemed at 
first to open up limitless possibilities, we now see that all 
representations are constructed from primitive ones and that 
the number of possible primitive representations is confined 
within narrow limits. The further content of the general theory 
of representations can be stated in the theorem that the sets of 
functions, the orthogonality of which we have shown above, are 
complete orthogonal systems. The primitive characters con- 
stitute a complete orthogonal system in the domain of class 
functions, i.e. there exist exactly K inequivalent irreducible 
representations. The components of a complete system of K 
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inequivalent irreducible representations constitute a complete 
orthogonal system for the totality of functions defined on the 
group manifold, or 

a tal ae a ae 
where the sum on the right is extended over such a complete 
system and g, g’, °° * are the dimensionalities of the individual 
irreducible representations. 


§ 12. Extension to Closed Continuous Groups 


The theory developed in the preceding sections cannot be 
extended to arbitrary groups, but it is applicable mutatis 
mutandis to a group whose elements constitute a continuous 
closed manifold of a finite number of dimensions. Just as the 
immediate neighbourhood of a point on a surface constitutes 
a plane, so the immediate neighbourhood of a point pg on an 
r-dimensional continuous manifold constitutes an r-dimensional 
linear manifold and the line elements from py to neighbouring 
points p define an r-dimensional linear vector space. We 
assume that the infinitesimal elements of our group g (i.e. those 
elements in the neighbourhood of the unit element {), or rather 
the infinitesimal vectors leading to them from |, constitute 
such an r-dimensional vector space, the ‘‘ tangential space ’”’ 
to gq at I. The concept of an infinitesimal rotation will be 
familiar to the reader from the kinematics of rigid bodies, as 
well as the fact that these infinitesimal rotations in 3-dimen- 
sional space constitute a 3-dimensional linear family—in 22-dimen- 
sional space an [n(n — 1)/2]-dimensional family. The multiplica- 
tion of two infinitesimal elements of the group is then expressed 
by the addition of the corresponding vectorial line elements in 
the tangential space. 

A parallelepiped which will serve as a volume element in 
the neighbourhood of | 1s defined by 7 linearly independent 
line elements, and its volume 1s given as usual by the absolute 
value of the determinant of the components of these 7 vectors. 
This volume element is, of course, not entirely independent of 
the choice of a co-ordinate system in the tangential space, but 
the transformation to a new co-ordinate system only multiplies 
the volumes of all such elemental volumes in the neighbourhood 
of | by a constant numerical factor. These volumes are there- 
fore determined to within the choice of a unit of measure; more 
than this we can hardly require. 

On extending the theory developed in the preceding section 
to continuous groups integration replaces su nmation, and it is 
therefore necessary to be able to measure volumes on the entire 
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group manifold of g. With the aid of the foregoing volume 
elements in the neighbourhood of [ can be measured and com- 
pared immediately with cach other, and the same is true for 
the volume elements at any other point of the group manifold. 
The only difficulty lies in carrying the unit of volume from the 
point | to any other point @. Examination of the argument 
of § 11 reveals that the measurement of volume must have the 
following invariantive properties: the volume of an arbitrary 
element must be unaltered by a left-translation of the group 
manifold which transforms the general clement ¢ into 7 = at. 
But this requirement just suffices to specify the process uniquely. 
Consider the volume element at a which arises from an elemental 
volume at [ by the left-translation which throws I into a; per 
definitionem the volumes of these two elements shall be the same. 
On carrying the volume element from a to b by means of the 
translation t’ == (ba~!)t the equation t! = b(a™"t) shows that with 
this definition of volume the volumes of the elements so obtained 
at a and 6 are equal. 

We further assume that our continuous group manifold is 
closed~—in the sense, for example, that the surface of a sphere 
is a closed manifold in contrast with a Euclidean plane, which 
is open. This guarantees that we shall be able to integrate 
continuous functions of position on the group manifold over the 
entire manifold. We now choose the unit of volume in such a 
way that the volume of the entire manifold g is 1; the integrals 
are then mean values. Wenaturally require that the components 
of U(s) in a representation s—» U(s) are continuous functions 
of the element s of gq. The laws (11.5), (11.6), (11.7), (11.7) 
and all consequences obtained ‘from them in § 11 are then valtd 
for irreducible representations of the continuous group qg and their 
characters.§ 

The theory would be extraordinarily restricted if the measure 
of volume, which we have introduced in such a way that it ts 
tnvariant under left-translations, were not automatically invariant 
under (1) right-handed translations: s—> s’ = sa and (2) inversion : 
s—>s'=s}, The first of these properties will be established 
by showing that the volume of a volume element at I is unchanged 
on taking it to a by a left-translation and returning it to | by a 
right-translation. Obviously each infinitesimal element 6s of 
the group then undergoes the linear transformation A: 


6s > &’'s = ads: at, 


i.e. the conjugation £, associated with the element @. Such 
linear transformations in the r-dimensional vector-space of the 
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infinitesimal elements of the group constitute a representation 
a~» A of the abstract group g. Since gq 1s closed, each A must be 
‘* absolute-unitmodular,”’ i.e. the determinant of A must have the 
absolute value 1; and this in turn allows us to conclude that 
the definition of transportation of volumes by either left- or 
right-translations leads to the same result. To prove this 
consider the element a and its powers a*, a®,-- +. Since the 
group manifold g is closed, the infinite set a, a?, a®,--- on g 
possesses a point of condensation J, 1.e. an infinite set of ex- 
ponents » can be found such that as runs through this set 
a” converges to b. To the elements a” and 6 correspond the 
conjugations A” and B, respectively, and in virtue of the con- 
tinuity assumed above det (A") converges to det (B) as ” runs 
through the chosen set. Now since det(B) 1s a finite non- 
vanishing number, and since, if the absolute value of the deter- 
minant of A differed from 1, det (A") would tend toward 0 or o, 
we may conclude the truth of the above assertion. This also 
enables us to prove the truth of (2), invariance under inversion. 
For inversion sends the element ds at | into — 6s, and this 
transformation is absolute-unimodular. Now send one of two 
inverse volume elements at I to a by a left-translation and 
the other to a“! by a right-translation ; we thus obtain volume 
elements at @ and a~! which go into each other by the inversion 
s—>s'=s"'. Since both left- and right-translations conserve 
volumes, these two volume elements have the same volume. 


Examples of the Orthogonality Properties 


We have already found the -primitive characters for the 
group of rotations Dd, of a circle into itself: e(md), m=0, +1, 
+2, ++ +, where ¢ is the angle of rotation. They constitute, 


in fact, a unitary-orthogonal set of functions : 
on 


es _ {2a (m == m’) 
J e(md) e(mn'b) 4b = 1) Gan em) 
If there existed further irreducible representations their char- 
acters would necessarily be orthogonal to all of these; but this 
is impossible, for the functions e(md), where m takes on all 
integral values, already constitute a complete orthogonal 
system. We have, however, already shown by a more direct 
method (§ 8), which did not involve Parseval’s equation, that 
the system of primitive characters e(md) was complete. It is 
therefore natural to consider Parseval’s equation as the simplest 
case of the general group-theoretic completeness theorem men- 
tioned in § 11. 
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The character of the representation ©, of the 2-dimensional 
unitary unimodular group U = Uy is given by (9.6). Writing 


e=-e(w), A=e —e l= 2 sina, es AAdw = do, 


we have 


w 


_ fl (f= 8) 
Fxrxedo= | (fae (12.1) 


This leads us to suspect that do is the volume of that portion 
of the group manifold occupied by those elements o of the group 
whose angles of rotation le between w and w-+ dw. [The 
total volume of the group manifold is then 


w = 0 


lL pace 


If this is correct, (12.1) are the orthogonality relations predicted 
by the general theory, and the equation 


da — as A Adw 
2a 


defines the density of the various classes of the group. In the 
last chapter we shall actually carry through the determination 
of volume and verify these results. 

If there were yet another trreducible representation, with 
character xy, then € = A-y would be an odd periodic function 
of w with period 27 which would be orthogonal to all the functions 
Er= A+ xy, i.e. to the functions 


sinw, sin 2w, sin 3w, °°: 


But these latter are already a complete orthogonal set for 
the domain of odd periodic functions, and consequently the 
C, (f= 0, 1, 2, + + +) constitute a complete system of irreducible 
representations of the group u. A direct proof, which 1s inde- 
pendent of Parseval’s equation, is also to be found in Chap. V, 
§ 16—indeed, it is there carried through for u, in an arbitrary 
number 7 of dimensions. 
The Clebsch-Gordan series 


XtXo = Xrvot Xie t+ EN r-9) (12.2) 


for the characters yy is readily verified. If we know on general 
grounds that the character of a representation specifies 1t uniquely, 
this equation can be used as a proof of the reducibility of ©, x ©, 
into irreducible components with characters as on the right. 
Since the characters are much more readily handled than the 
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representations themselves this principle offers a very powerful 
method for obtaining assertions concerning representations. 
Let f 2g and multiply equation (12.2), which is to be verified, 
by A: 
Exo = DS (Ges fe Gof ee 2 eh fg): 
The product of 
&, = eftl —e(ftl) with y, = 9 + eo? 4-+--+-+4+ E79 
is the difference of two sums; the one ts 
eftgt! +. efig-1 fos ee efii-g 


the exponent decreasing by 2 from term to term, and the other 
is obtained from this one by replacing all exponents by their 
negative. Hence the product is in fact 


Y{ewts — e-em) p= fteyeftg—%-++4f—g. 


The representations 67, ©; (f= 0, 1, 2, + + :) constitute a 
complete set of inequivalent irreducible representations of the 
augmented group uz. To establish this we first note that in an 
irreducible representation of u’ the matrix associated with the 
element « must be a multiple of the unit matrix, for it commutes 
with the irreducible system of matrices constituting the repre- 
sentation. Furthermore, w= 1, so this matrix can only be 
+1or—t1. Since the matrix associated with « is a multiple 
of the unit matrix, and since the extension of u to w involves 
the addition of a single element c, the representation must remain 
irreducible on restricting the group w’ to the sub-group u. Hence 
every irreducible representation of u, 1s obtained by supplement- 
ing the irreducible representations of u, by the association 


e>+1 or i> — 1. 


If §, ' run independently through complete systems of 
inequivalent irreducible representations of the two (finite or 
closed continuous) groups g, g’, respectively, then the 5 X ’ 
constitute a complete system of inequivalent irreducible rep- 
resentations for the direct product g x g’. To prove this we 
note that since the primitive characters y(s) of g constitute a 
complete orthogonal system for class functions of the element s 
which runs through g and the primitive characters x’(s’) of g’ 
do the same for g’, the totality of the products y(s) - y‘(s’) con- 
stitute a complete orthogonal system for the class functions of 
the element (s, s’) which runs through the group g x q’. 

The representations ©,, , introduced in § 5 constitute a com- 
plete system of irreducible representations of c,. when f, g run 
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independently through the numbers 0, 1, 2, - +--+; we here 
only mention this fact without going further into it. 


§ 13. The Algebra of a Group 


We return for the present to finite groups. In order to be 
able to express the completeness theorem we associate with 
each function x(s) on the group manifold of the finite group g 
its ‘* Fourier coefficient matrix,” the group matrix. 


A = J'x(s)U(s), (13.1) 
where §9: s— U(s) is a representation of g. The trace of X, 
f= J'x(s)x(s), (13.2) 


is the Fourier coefhcient of x(s) with respect to the character 
x(s) of §. It 1s here desirable to consider the function x(s) as 
a single quantity x in the group domain, cach element s of the 
group is a dimension in ‘‘ group space’? and the number 4(s) 
is the s-component of the quantity x. We may express the 
quantities themselves symbolically in the form 


X= D175) * Ss. (13.3) 


The matrix X is associated with the quantity x in the repre- 
sentation 9: x— X in |. Addition of “ group quantities’ and 
multiplication of them by a number are introduced in the usual 
way: X -- y has the components a(s) -+ y(s) and ax the com- 
ponents a+ x(s), Group quantities consequently behave like 
vectors in an A-dimensional space, where /t is the order of the 
group. The following definition of multiplication of two arbitrary 
group quantities X and y is suggested by (13.3) : 


zo xy Baliye = Eo) s 
8 


where 


2(s) ee NE), (13.4) 


This last equation, in which the sum is to be extended over all 
pairs of elements ¢, t! whose product is s, defines the product Z 
of the quantities x and y. We denote this product by xy and its 
components by xy(s); this is not to be confused with x(s) - y(s), 
the ordinary product of the two numbers x(s), y(s). Addition 
and multiplication of group quantitics parallel addition and 


166 GROUPS AND THEIR REPRESENTATIONS 


multiplication of the group matrices associated with them by 
(13.1). Indeed, the product of 
X= Lxs)U(s), VY = Lys) U(s) 
is given by 
Z= XV = Sxldy(eU(t’) = F2(s)U(s), 
t,t’ 8 
where 2(s) is defined by (13.4) 

The operations to which the group quantities may be sub- 
jected: (1) addition, (2) multiplication with a number, and (3) 
multiplication with one another, satisfy the usual laws of 
ordinary algebra with two important exceptions: multiplication 
1s not commutative and division is not in general possible, i.e. the 
equation ax = b for given a + 0 and b may have no unique 
solution or even no solution at all. But there does exist a 
quantity 1 having the properties of unity: la = al =a for 
every quantity a; its components all vanish with the exception 
of the one associated with s =I, which is 1. A domain of 
quantities as described above is called an algebra,® and the 
“group quantities’ are the elements of the algebra ; care must 
be taken not to confuse these with the elements of the group 
(cf. V, §5). The association x > X in the representation 
satisfies the conditions : 


1. 1 > 1, to the element 1 corresponds the unit matrix 1 ; 
2. 1f x > X, y > Y and @ is a number, then 


x+yoX+ Y, ax>aX, xyoXy. 


A representation § of the group is the same as a realization or 
‘‘ representation ” of the algebra of the group by matrices such 
that these conditions are satisfied. Actually all we have done 
here is this: we have gone over from the matrices U(s) associ- 
ated with the individual elements of the group to the linear 
manifold of matrices for which they constitute a basis. 

What characterizes an element a of the algebra whose com- 
ponents a(s) define a class function? We have in general 


ax(s) = Sa(st)xt), xa(s) = Jalts)x(), 


and a class function satisfies the equation 

a(st) = a(ts). 
Hence such an a 1s characterized by the fact that it commutes 
with all elements x ‘of the algebra: ax = xa. Employing a 


term carried over from group theory to algebra we may say: 
those elements whose components depend only on the class of 
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conjugate group elements to which the argument s belongs constitute 
the central of the algebra. 

We are interested only in unitary representations s-> U(s), 
For such a representation the Hermitian conjugate of (13.1) is 


X= Las) U(s) = Zi(syU(s) = THs )U(s). 


Hence on defining the conjugate x of the element x by (s)=@(s~}), 
Hermitian conjugate matrices are associated with conjugate 
elements in a unitary representation; this characterizes unitary 
representations. An element will be said to be real if it coin- 
cides with its conjugate. We have scen that the character 
x(s) of a unitary representation satisfies this condition 
x(s) = x(s~*). 

Let 5 be a g-dimensional irreducible unitary representation 
of g. C == ||c,,|| being a given g-dimensional matrix, the element 
c of the algebra defined by 


ae 6, als _§ Pile 
cs) = ay Pau (s) tt [CU(s)] 


is such that c> C in §; this is readily verified with the aid of 
the orthogonality relations. Hence in the correspondence x > X 
X runs through all g-dimensional matrices. We denote the 
quantity with components © taal) by e,,. The set H of all 


elements of the form 


Da Oyk Ciks 
i, k 


where the coefficients c,, are arbitrary, is naturally closed with 
respect to the operations of addition and multiplication by a 
number. But the product of two elements in H is again an 
element in H; indeed, if ¢ is in H and x is an arbitrary element 
of the algebra both cx and xe are also in H. We express this 
situation in a terminology paralleling that of the theory of groups: 
H is an invariant sub-algebra of the algebra I of all group quantities. 
To prove these assertions we first note that the definition (13.1), 
together with the condition that s—> U(s) be a representation 
yields the equation 


XU(s7}) = Y'x(t)U(ts~}), 


XU(s) = SU(st-)x(t\, (13,5) 
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Multiplying on the left by C = ||c,,|| and constructing the trace 
we find 
E tr [(CX) O(s)] = Le(st*)x(t) = cx(s), 
t 
whence y = cx isin H: 
cx = a Vik Cik (13.6) 


and the matrix 
I|ysa]] = CX. (13.7) 


In the same way we can show that if c belongs to H then xe 
does also. If 
x—> X = ||x,,|| in © 
we Call 
Os Xik Cik 


the component of x in H. In accordance with (13.6), (13.7) this 
component is the product of x with 


@ = Cy, + Con +6 + Cgg; 
it is ex = x@. & is a real element belonging to the central of the 
group algebra J’, with components 2 -x(s); it is ‘ 7dempotent,”’ 


i.e. It satisfies the equation e¢ =e. In particular, the product 
of two elements 


a= Da ,ex, B= Soyer 


of H with coefficient matrices A, B, is the quantity ab in H 
with the coefficient matrix AB. eis the 1, the ‘“ modulus,” or 
‘ principal unit,”’ of the sub-algebra H since ex = xe = xX when 
x is in H. The algebra H is identical with the algebra of all 
g-dimensional matrices (‘‘ simple matric algebra’). The ‘‘ units "’ 
e,, satisfy the equations 


Cer Orn = Cin, Cir es. = 0 for r Se or S. (13.8) 


The central of the sub-algebra H consists only of the multiples 
of its modulus e. 


An irreducible representation §’:s—> U'(s) = || u/,(s) || of 
dimensionality g’ which is not equivalent to § yields another 
invariant sub-algebra H’ consisting of all elements of the form 


= C". 


c’ 
x 


, ‘oe 
C= 20.6 | 
t,« 
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/ 


The components of ef, are Bei (6), It follows from the 


h 

orthogonality relations existing between inequivalent  repre- 
sentations that c’-> 0 in the representation §. If ¢ is in H, 
then, by applying (13.6) for x = c’, cc’ = y 1s also, but since 
then X =O (13.7) yields y=0; the two sub-algebras are 
independent in the sense that the product of an element in one 
with an element in the other 1s always 0. Hence the ‘‘ units ”’ 
satisfy 

€,,€ = 0: (13.9) 


The modulus 


of H’ satisfies eg’ == @’g = 0 in addition to e’e’ = e’. 

If a(s) is a class function, a belongs to the central of I" and 
if a-—» A in the g-dimensional irreducible representation 
then the matrix A commutes with all matrices X. Hence A 


is a multiple of the unit matrix: A = + By (13.2) we find 
that the trace a of A is * 
a= Sa(s)x(s). 
8 
In this way the entire theory of representations can be 
translated into the language of modern algebra. This leads to 
a greater freedom of operation and is preferable for the expression 


of the completeness theorem. The orthogonality relations 
between u,,(s), uU(s), + + + lead to Bessel’s inequality 


g-tr (NX) 4-- 0° < he Sx(s)a(s), (13.10) 


where X in the sum on the left is the matrix (13.1) associated 
with x(s) in the g-dimensional irreducible representation \) and 
the sum is taken over any set of inequivalent irreducible repre- 
sentations §, +++. This inequality 1s obtained by expressing 
the fact that the mean value of s(s) 3(s) is non-negative (cf. I, § 7), 
where z is that element obtained from x on subtracting from x 
its components in H,- : -: 


Z=K— (Sx Cin + oc) =X (xe + °°). 


Since the characters constitute an orthogonal system we also 
have the Bessel inequality 


EEfrre < b> Sx(s)(s) (13.11) 


* Cf, also Appendix 2 at the end of the book. 
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where & is defined by (13.2). The completeness theorem asserts 
that 1n both cases the equality sign holds when the sum 1s extended 
over a complete system of inequivalent irreducible representations, 
where 1n (13.10) x(s) 1s any function on the group manifold and 
in (13.11) any class function. The second relation 1s a special 


case of the first, since for class functions X = é 1. 


If the abstract group g is a finite continuous group which 
is closed in the sense of § 12, instead of a finite group as above, 
the sums must be replaced by integrals ; the measure of volume 
on the group manifold is introduced as in §12. We then have 
in place of (13.1), (13.4): 

ee {x(s)U(s)ds 


) 


xy(s) = Jx(st)y(t)dt = J x(t)y(t-'s)de. 


The modulus 1 of the algebra must have as components the 
values of a function 1(s) which vanishes everywhere on the 
group manifold except at the point s = 1 and must there be 


so large that J U(s)ds = 1. Such a function does not exist, but 


we can construct functions approximating these conditions 
arbitrarily close. 

The completeness relations assert that any element x of 
the algebra of a finite group g is the sum of its components in 
the totality of sub-algebras associated with a complete system 
of inequivalent irreducible representations. The group algebra 
I is thus reduced to a set of independent simple matric algebras. 
It suffices to prove this theorem for x = 1: 


1=e+e+--'=(ey +++ +e) +°°°, (18.12) 
for on multiplying this by x it follows tor all elements x. These 
assertions cannot be carried over to continuous groups in the 
form here stated; we must hold to the formulation (13.10) 
(with = instead of SS) containing an arbitrary function x(s). 
We go into the proof of these results in Chap. V, where all 
the results of this section will be derived anew and discussed in 
detail from another more profound point of view. 


§ 14. Invariants and Covariants 


We first discuss briefly the classical concept of an invariant. 
Consider, for example, the group ¢ = ¢, of homogeneous linear 
transformations of two variables €, 7 with unit determinant. 


Let 
ag* + 2bén + cr? 
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be an arbitrary quadratic form in the two variables. The 
‘“ discriminant ’’ ac — 6? is an invariant, for the discriminants 
of two forms which are such that either goes into the other on 
transforming £, 7 by some element of ¢ have the same value. 
We may have, instead of one arbitrary quadratic form, one or 
more arbitrary forms f, ¢, - + + of given orders, n, v, °° -. An 
invariant is a rational integral function J of the coefficients of 
these forms which is homogeneous in the coefficients of each of 
the forms f, ¢, +++ and which has the same value on replacing 
these coefficients by the coefficients of the forms f’, ¢’, - + + into 
which f, ¢, « + + are transformed by an arbitrary transformation 
o of c affecting the variables €, ». 

The coefficients a, a,, °° *, @, of an arbitrary form of order 
n in the variables €, 7 undergo a certain linear transformation 
on subjecting the variables to a transformation o of c, and the 
correspondence between o and this transformation constitutes 
a representation of the group ¢. The same is true for the totality 
of monomials 


agay - +: azn rotrnterrstr =r) 
of order 7 in these coefficients. A homogeneous polynomial 
I of order y in the a, is a linear combination of these monomials. 
We thus see that if J is of given degrees 7, p, * ++ in the coefficients 
of the arbitrary forms f, ¢, + + * it 1s a linear combination of 
quantities which constitute the substratum of a definite re- 
presentation of ¢; this representation is known as soon as we 
have given the orders n, v, > + + of the forms f, ¢, «+ + in the 
variables €, 7 and the degrees 7, p, - ++ of the invariant J in the 
arbitrary coefficients of f, ¢,---. Duiscarding the all too special 
formal algebraic assumptions involved in the ‘‘ classical ”’ 
concept of an invariant, and which the theory of invariants has 
from the beginning attempted to outgrow by generalizations in 
various directions, we may express the concept in modern 
eroup-theoretic language as follows : 
Let §: s > U(s) be a given representation of an abstract group 
Q in an n-dimensional representation space KR with variables x; ; 
a linear form in the x, 1s said to be an invariant in the representation 
space R of if it 1s unchanged under all the transformations U(s). 
f I,, Ig, ° * * are invariants in the representation space of §, 
then any linear combination o,/, + o./,+-° °° of them with 
constant coefficients a,, %, ° °° is also an invariant. The most 
important problem arising here is naturally that concerning the 
number m of linearly independent invariants in the given 
representation space. If V1, Voy ° °° Vm constitute such a com- 
plete set of linearly independent invariants, and if we choose as 
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co-ordinates in R these m quantities and m — m further linear 
formS Yiy.1, °° *; ¥_ Such that the two sets together constitute 
a complete system of linearly independent linear forms in §, 
the transformation U/(s) is, in terms of the variables y, 


Vi = Vie aaa Va ye 
Vn+1 =e Um+1 1 (s) Vy = oa eo aS Um+1) n (s) VY ny 


Vn es Uny(S) V1 2 aac! Unn(S) Vn- 


If we are dealing with a unitary representation the y's can be so 
chosen that they define a normal co-ordinate system; §) 1s 
then reduced into m times the 1-dimensional identical repre- 
sentation y’ = y and an (nx — m)-dimensional representation. 
Hence the problem of finding the number of linearly independent 
invariants in the representation space § reduces to finding how 
often the identical representation with the character 1 is con- 
tained in the given §. But by formula (11.8) the solution of 
this problem 1s given by 


m = Mix(s)}, (14.1) 


or: the mean value of the character x of §), which 1s always a 
non-negative integer, gives the number of linearly independent 
invariants in the representation space of §). 

The formula (14.1) answers the principal question arising 
in the linear invariant theory, and we now proceed to an ex- 
tremely brief discussion of the algebraic invariant theory. Let 
®, $, °° + be representations of the same abstract group g in 
the spaces with variables x,, y,, + * *. We consider rational 
integral functions /(x,, y,, ° + +) which are homogeneous in the 
variables x;, homogeneous in the variables y,, etc. If on sub- 
jecting x, y, ° + * to those linear transformations corresponding 
to the same arbitrary group element s in the representations 
®, , + + J remains unchanged, then it is said to be a rational 
integral invariant of the system [W, 9, > + -] of representations. 
If the orders p, q,°** of the function J in the variables %;, yx, ° °° 
are given, the problem reduces to the one discussed above ; 
for the monomials in these variables which are homogeneous 
of order p in the x;, homogeneous of order g in the y,, * + + con- 
stitute the substratum of a representation obtained in a certain 
way from @, §, -: -. But if we consider simultaneously in- 
variants of all possible orders belonging to the system [G, , - °°] 
we are confronted with new problems. The most important of 
these, which is answered in the afhrmative by the so-called 
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fundamental theorem of the theory of invariants is: Do there 
exist a finite number of invariants such that all others can be 
expressed rationally and integrally in terms of them? This 
involves the question of algebraic, rather than linear, dependence 
between the invariants. We only mention this higher branch 
of the theory of invariants, and do not go into it further, as it 
bears no direct relation to quantum mechanics, !° 

In addition to invariants or scalars, covariant linear 
quantities, such as vectors and tensors, play an important 
role in physics. Let q be the group of all linear transformations 
between the normal co-ordinate systems in space or in space- 
time, e.g. the 3-dimensional group of Euclidean rotations or 
the group of Lorentz transformations, and let §:s5s-—> U(s) be 
an n-dimensional representation of g. A covariant quantity of 
kind §) ts an entity having n components a,, Ao, °° *, A, relative 
to any given co-ordinate system for the vartables of the transforma- 
tion group q and which ts such that on going over to a new co- 
ordinate system by means of the transformation s of q the new 
components a, are obtained from the old by the corresponding 
transformation U(s) of %. If % 1s irreducible such a quantity 
is said to be primitive or simple. Physical quantities are generally 
simple. Thus, for example, the entity whose components are 
the electro-magnetic field strengths in the 4-dimensional world 
is described as an “ anti-symmetric tensor of order 2” rather 
than merely as a “ tensor of order 2°’; we shall see in Chap. V, 
§ 4, that it is therefore a simple quantity. The reduction of 
a representation into its irreducible constituents implies the 
reduction of the corresponding kind of quantities into simple 
quantities. It would appear that the only simple quantities 
with which we deal are tensors which are characterized by 
certain symmetry conditions in addition to their order. We 
shall prove this theorem for the complete linear group ¢ and for 
its unitary sub-group u in Chap. V; it asserts that all repre- 
sentations of ¢ (or u) can be obtained by reduction from the 
powers ¢, (c)*, (c)3, -°- and that the irreducible constituents 
of (c)f arc obtained by imposing certain symmetry conditions. 

We must accordingly generalize the problem of the linear 
theory of invariants in the following manner. Consider two 
unitary representations §:0—> 5s, ):o—-S of the abstract 
group g with clements o; let their dimensionalities be n, N 
and let ) be irreducible. We wish to determine all covariant 
quaniities of kind t in the representation space of S. Calling the 
variables in this representation space x,, which undergo the 
transformation S under the influence of o, such a quantity 
I has n components J,, /,,°+ +, J, which are linearly independent 
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linear forms in the variables x; When the x; undergo the trans- 
formation S the n linear forms J, go over into new ones which 
are obtained from the J, (in which the variables x; have been 
transformed in accordance with S) by means of the transforma- 
tion sof h. If there exist two or more covariant quantities 


T=(1,,1,°°°,1,), [= Ce I, ° vo 1a be. 4 
of the kind § in the representation space of §, then any linear 
combination «af + «’/’ +--+ with constant coefficients « is 


again a quantity of the same kind. We ask for the number m 
of linearly independent quantities of thts kind. The answer is 
that m 1s equal to the number of times the irreducible representation 
bh is contained in §. Hence if y, X are the characters of h, §, 
we have 


m = IM{X(s)x(s)}. (14.2) 


In order to prove this statement we choose the co-ordinate 
system x, in the representation space of § in such a way that 
the matrices of § are reduced into their irreducible constituent 
sub-matrices, the m representations ): h! = h” =-:+ = }™ =) 
being separated out first. The remaining constituents h(*), 

- are inequivalent to §. Denote the variables in the corre- 
sponding invariant sub-spaces by 

co ce oa eo 
The matrix S is completely reduced into the sub-matrices 
so =.5,°°+, SM == 5; smtl) +--+ arranged along the principal 
diagonal. Let 


Vy = AyX% + 1 ain FH, | 


Vn = Any %y °° + ayy eel 
be a covariant quantity of the kind . We can write this in the 
form y = Ax in terms of the column x of the N variables x,, 
the column y of the variables y, and the matrix A = ||a,,||. 
The requirement that / be a quantity of kind means that 
when x is replaced by x’ = Sx, y goes over into y’ = sy, or 


sy = ASx, sAx = ASx, sA = AS. (14.3) 


Corresponding to the reduction of x-space into irreducible 

sub-spaces, the matrix A of the correspondence of x-space on 

y-space is reduced into matrices A’, ---, AM; Ami)... 

consisting of the first 2 rows, ---:, the m'® set of n rows, 
-- of A. Equation (14.3) then becomes 


sf’ aus A's, i s Alm) — Ams : sAlm' ) — Alm s(n 1) me ce 
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It follows from the fundamental theorem (10.5) on representa- 
tions that A’, ++ +, A™ are all multiples of the n-dimensional 
unit matrix and that the remaining A(+) -- - are all zero. 
But this is just our assertion that y= (y,, ye, °° *, y,) is a 
linear combination of the m quantities 


of the kind bh. 


§ 15. Remarks on Lie’s Theory of Continuous Groups 
of Transformations 


In § 12 we made use of the concept of infinitesimal elements 
of a group in order to establish a method of measuring volume 
on a continuous group manifold. We here discuss this concept 
in detail for the 3-dimensional group D of rotations in Euclidean 
space.4!. This group serves to describe the mobility of a body 
in Euclidean space, one point O of which ts fixed in space. Each 
possible position of the body can be considered as arising from 
any given initial position by an operation of dD. A material 
substance distributed throughout the space or any portion of 
it moves as a rigid body about O if the position of each of its 
elements at a given moment is associated with its initial position 
by means of a correspondence belonging to bd. This is the 
description of the motion of such a rigid body which compares 
the position in any moment directly with the initial position, 
ignoring the intermediate states which it has assumed in going 
from the one into the other. But it seems more natural to 
consider it in terms of a continuous motion in which the position 
of the body undergoes an infinitesimal rotation from moment 
to moment, so that the motion as a whole is the integration 
of a series of infinitesimal operations of 0. On employing an 
auxiliary variable ¢t in order to avoid the use of infinitesimals 
and thinking of this parameter as time, the velocity field 
dx = &, dy = y, dz = Zz of an infinitesimal rotation 1s defined 
by {cf. I, § 6] 


dx = be —cy, dy=cx— az, dz= ay — bx, (15.1) 


where the constants a, b, c are independent of position (x, y, 2). 
These velocity fields, which obviously constitute a 3-dimensional 
linear manifold, are the infinitesimal elements of 0D; they are 
the ‘“ vectors ’’ which define the linear space tangent to the group 
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manifold at the point which represents the unit element I. 
The continuous motion of a rigid body about O is characterized 
by the fact that at each moment its velocity field belongs to 
the 3-parameter linear family (15.1). We may take as a basis 
of this family the three clements D,, D,, D, obtained by choosing 


G21 b= 06] 0) a=), OS 1) eS 0 aS 0 OS eS 1 


We call these ‘‘ the infinitesimal rotations about the x-, y- and 
g-axes.”’ S. Lie was the first to undertake a systematic study 
of the construction of transformation groups from their in- 
finitesimal elements. In fact, once they are known all the 
substitutions of the continuous group can be generated by 
integration, t.e. by successive application of such infinitesimal 
elements—at least, all those which belong to the same connected 
“sheet? as the identity. (Example: the proper orthogonal 
transformations can be obtained from the infinitesimal ones, 
but not the improper transformations with determinant — 1). 

In general, consider a continuous r-parameter transformation 
eroup &, and let the group manifold be described in terms of 
the parameters s,, 53, * * *, Ss; in the neighbourhood of the unit 
point, at which they vanish. A portion of the group manifold 
is thereby mapped in a one-to-one continuous manner on a 
neighbourhood of the origin in the r-dimensional number space 
of the parameters s. Let the u-dimensional point-field of the 
transformations be described in terms of co-ordinates xj, %9,°'*, Xn 
in the neighbourhood of the point under consideration, and let 
the correspondence x — x’: 


fee Cae re Xn) Sy, ane =) 


be associated with the element (s,, 52, °° *, S,) of the abstract 
group in its realization by the transformation group. The 
infinitesimal transformation x > x + dx obtained by assigning 
the infinitesimal increments ds to the parameters s in the neigh- 
bourhood of s = 01s given by 


op; eats op; 
dx; = (Se) as -+- + (Set) as, (15.2) 
the parentheses indicate that the differential quotients are to 
be computed for s,; = 0, +++, s,= 0. We postulate a material 
substance which fills the point-field and which is capable of 
executing those and only those motions in which the positions 
of its elements at an arbitrary moment ?’ are obtained from their 
positions at time ¢t by a transformation of &. Again its motion 
can be more simply described as the result of successive deforma- 
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tions corresponding to infinitesimal operations (15.2) of our 
group; the velocity field must at any time have the form 


t= (A) o, font (Fo, (15.3) 


Sy 


where o,, * * *, o, are constants independent of position. This 
r-dimensional linear family constitutes the infinitesimal group of 
motions of our substance. It is to be observed that the application 
of these infinitesimal processes to our transformation group 
presupposes that the functions ¢, are differentiable with respect 
to s at the point s=0. In the theory of abstract groups the 
point-field 1s the group manifold itself and we take as a realization 
(left-)translation. In the neighbourhood of the unit element 
s = 0, t= 0 we have, as law of composition, 


[Sg Sp See Se eet ee Te a ey 


The introduction of a measure of volume in § 12 presupposes 
that the functions #, are, for sufhiciently small ¢, differentiable 
with respect to the s at the point s = 0, and that for sufficiently 
small s they are differentiable with respect to ¢ at t = 0. 

The composition of infinitesimal elements of the group is 
expressed by addition of the parameters o introduced by (15.3). 
It might therefore appear as if the infinitesimal clements of an 
r-parameter continuous group need satisfy no condition other 
than that they constitute a linear family. However, that is 
not the case; there are further “integrability conditions’ to 
be satisfied. The example of a sphere which rolls without 
slipping on a horizontal table shows that the possible positions 
of a body whose infinitesimal motions have but three degrees 
of freedom can nevertheless constitute a 5-dimensional manifold. 
The integrability conditions we are seeking, which involve 
second order derivatives, guarantee that this situation does not 
arise. We obtain these conditions on expressing the fact that 
the commutator sts~!t-! of two infinitesimal elements s, ¢ of the 
group also is an element of the group. This commutator con- 
verges to | as $ approaches the unit clement I, whatever ¢ may 
be, and similarly as ¢-» lt for arbitrary s. The commutator of 
the two infinitesimal /inear correspondences A and B: 


ak AX. Ax Bx 


is the infinitesimal correspondence AB — BA; to show this 
we note that the equation 


A(s)B(t) = I's, 1) BC) A(s) 
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leads, on writing 


A(0) = 1, B(0) = 4, (=). ay (=) — B, 


t= 


_ (sf) -1 2 /@r = 
0 aaron aa tes 
to the equation 
C = AB — BA. 


Our main purpose in mentioning these matters 1s to prepare 
the ground for an understanding from general principles of the 
commutation rules satisfied by the three infinitesimal rotations 


D,, Dy, Dz: 


0 0 0 0 0 1 0 —1 O 
0 0 —IiI, 0 0 OF}, Tl O OF. (15.4) 
0 1 0 —l1 0 0 0 0 0 
They are, as 1s readily shown, 
D,D,— D,D,=D, D,D,— D,D,= ™ (15.5) 
D,D, — D,D,= Dy. 


We could, of course, take the unimodular unitary group Us 
in two dimensions as fundamental, instead of the group bd, of 
rotations. We denote the two variables which undergo the 
transformations o of the unitary group by €, 7 asin §8. In 
consequence of the correspondence o -> s, which was established 
there by means of a stereographic projection, the 3-dimensional 
rotation group now appears as a representation of u,. We can 
take as a basis for the 3-parameter linear manifold of infinitesimal 
operators of u, the three particular operators— 


] l l 
vie dg = 947) dy = 578) 
] ] ] 
siSyi =—57, y= 5; (15.6) 
] 1 1 
here, in agreement with (8.15), 
0 1 0 —1 ] 0 
SS ; g=l. = : 
1 0 1 0 0 —l 


They are the infinitesimal transformations of u, corresponding 
to the three infinitesimal transformations D,, D,, D, of Dg in 
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virtue of the correspondence o -> s; that this is in fact the case 
is readily seen from (8.10) or 


ere 
xm — B+ n? poe Nee a): gw 2En. 


Given any representation §:o0-—> U(a) of Us, its infinitesimal 
operators with matrices 


] 
1 (M,, M,, M) 


corresponding to the infinitesimal operators (15.6) in u, 
satisfy the same equations (15.5) as the D,, D,, D:: 


M,M,-- M,M,=iM,,° °° (15.7) 


The matrices M,, M,, M; are of course Hermitian. For reasons 
which will appear in the following chapter we call these the 


components of moment of momentum (or angular momentum) 
of the representation §9, and 


M2 = Me = M2 + Me + M? 


the square of the magnitude of the moment of momentum. _ If 
§, $' are two representations with angular momenta Nt, M’ 
then, in accordance with the general formula IT, (10.4), which 
governs the composition of infinitesimal operators by x -multl- 
plication, the representation ) X §)’ has as moment of momentum 
(MN x 1) + (1 & WM). 

We next calculate the moment of momentum W, of the 
irreducible representation ©, == 9; (y= f/2) of u,. It will be 


. l ] 
found more convenient to employ in place of x50 of S,, the 


bud 


transformations 


] 


€ 


GS GGie ee, 0| | 
15.8 


RO} =— bo: 


SS ae, ded | 
In general 
d(érys) = r&t ys dé + 5 &t i dn, 


and on substituting in this the variables 


x(m) = ern 


rl st (r+ts= 2, y — $= 2m) 
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of the representation space of Dj, we find that the three infinitesi- 
mal transformations of u, defined by (15.6), (15.8) induce in this 
space the transformations 


5 (Se + 1S,) : de(m) = Ves 1) x(m — 1) 

= V(j+ m)(j — m + 1) x(m — I), 
5 (Se— 6S,): dx(m) = V5(FF 1) x(m + 1) 

= V(j — m)(j + m + 1) x(m + I), 


r—s 
2 


Hence 
(M, + iM,)(m, m — 1) = V(j + m)(j — m+ 1), 
(M, — iM,)(m, m + 1) = V(j— m)(7 + m + 1),7 (15.9) 


All other components (m, m’) vanish. M? is a multiple of the 
unit matrix in R;: 
M?* = (7 + 1), 
for it follows from 
(M, + 1M,)(M,—1M,) = M; a M; —(M,M, — M,M,) 
= M+ Mi+M, 
that 
M? = (M, + iM,)(M, —iM,) — M, + M2, 

and from this and (15.9) that 

M?(m, m) = (7 + m)(7 —m + 1) — m+ m? = 77 + 1). 

If on reducing an arbitrary representation § the irreducible 
representation 2, is found to occur exactly g; times, then M? 


has 747 + 1) as a [(27 + 1)g,]-fold characteristic number and 
M, has the characteristic number m with multiplicity 


“8; (7 = |m|, |m| + 1, + + +). 


From this we again see that the multiplicity g; with which 9, 
occurs in the reduction of §) is uniquely determined by . 
These infinitesimal operations can be used to give a relatively 
elementary constructive proof of the fact that the D, are the only 
irreducible representation of U,.?? 


§ 16. Representation by Rotations of Ray Space 


In quantum theory the representations take place in system 
space ; but this is to be considered as a ray rather than a vector 
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space, for a pure state is represented by a ray rather than a 
vector. Two unitary transformations U and eU which differ 
only by a numerical factor ¢ of absolute magnitude 1 are con- 
sequently to be considered as the same, U~eU, for they 
determine the same rotation of the ray field. In a “ ray repre- 
sentation,’’ which associates with each element s of the abstract 
group g a unitary rotation U(s) of the rays of n-dimensional 
representation space, the gauge factor e(s) may be taken 
arbitrarily for each unitary matrix U(s); if g is a continuous 
group we choose it, however, in such a way that U(s) depends 
continuously on s. The condition for a representation 1s now 
only 

U(s)U(t) ~ U(st), (16.1) 
ive: 

U(s)U(t) = &(s, t)U(st), (16.2) 
where 4(s, t) 1s a numerical factor, of modulus 1, depending on 
s and t. If by change of gauge Us) 1s replaced by e(s)U(s), 
d(s, t) is replaced by 

e(st)e3(s)e~4(t)d(s, t). 


A= 2” x(s) U(s), 


In the equation 


defining the connection between the components x(s) of an 
element x of the algebra of the group and the group matnx X 
which represents it, the x(s) are also dependent on the gauge 
and are sent into e(s)x(s) on the change of gauge defined by 
U(s)->e(s)\U(s). In order that the multiplication law for two 
elements x, y shall, as we require, parallel the multiplication of 
the matrices which represent them we must define 


xy(s) et b(t, t') x(t) v(t’) (16.3) 


in terms of the chosen gauge. The condition 
HS 4) == 245) 


for a real element x is only appropriate if the gauge is so chosen 
that U(s~!) is the matrix reciprocal to U(s). The algebra of 
the group is to be adapted in this way to the ray representation 
under consideration, whereas in dealing with ‘‘ vector repre- 
sentations ”’ it is uniquely determined by the law of composition 
of the group alone. 


Examples. 


I. The 1-dimensional representations are now entirely 
uninteresting, for any l-dimensional matrix ~1. But under 
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certain circumstances Abelian groups may possess multi-dimen- 
stonal unitary ray representations, whereas any irreducible 
unitary vector representation of an Abelian group 1s necessarily 
of degree 1. 

We first investigate the simplest example, a finite cyclical 
group (a) of order h, consisting of the elements 


[, a, a? os qh} (ah i I). 


Let the element a correspond to the unitary matrix A in the 
ray representation; then A* = a1 is necessarily a multiple of 
the unit matrix. Since « 1s of modulus 1 we may change the 
gauge in such a way that A goes into A/Wa; then A* = 1 
and the correspondence a*-> A® is a vector representation of 
the cyclical group. Hence by introducing an appropriate 
change of gauge the ray representation can be made into a 
vector representation, 4(s, t) being then 1. 

II. The simplest example of an Abelian group which gives 
rise to multi-dimensional irreducible ray representations must 
consequently be non-cyclic. Consider the group consisting of 
the four elements I, a, 0, c with the multiplication table 


ee ee ee 


bein dvink gakees, 9 
A ray representation ¥% is given by 
jl 0 0 | 0 7 LO 
ca ras i b > pile anaceast 
UW =[p ip F@=lf ge YO=L, oh YO | 
(16.5) 


The normalization ts here chosen 1n such a way that 
U*{a) == U(a)U(a™) = 1 

and similarly for I, 6, c. The algebra defined by (16.3) for this 
representation 1s non-commutative in spite of the Abelian 
nature of the group; it is the algebra of complex quaternions. 
On denoting the elements of this algebra by 

x=xl+Aa+ypb4ve, 
the “units” I, a, Bb, c have the same multiplication table as 
the corresponding matrices U: 

l a b Cc 


Pil oa b c (The product xy occupies 
aja | 1c = ib the intersection of the 
bib —ic | ia row X with the column y.) 
c\|c wb --ia | 
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The ‘‘real’’ quantities are those for which all components 
x, A, w, » are real. Since in the calculus of quaternions I, 7a, 
tb, ic arc taken as the fundamental units, they are those whose 
scalar component « is real and whose vectorial components 
A/i, w/t, v/t are purely imaginary. 

Ill. The group w= uy, of unitary transformations o in two 
dimensions with determinant 1. Consider a_ representation 
go —> U(a) by rotations in n-dimensional ray space. On changing 
the gauge in such a way that U(o) goes into 

U(o): Vdet U(o), (16.6) 
the determinant of the new U(o) is 1. The only possible diffi- 
culty consists in the fact that the 2 root 


e(c) = Vdet U(o) (16.7) 


is multiple-valued. It is ‘‘ locally’ single-valued, i.e. if we 
have chosen a definite one € of the » values for the point 
G == G9, We can uniquely determine the root e(a) in a sufficiently 
small neighbourhood of og in such a way that it depends con- 
tinuously on o and goes over into €, for o = a9. Hence we can 
continue the determination of the root for o = ag in a unique 
manner along a path in the group manifold, starting in ap. 
The only question is whether e(o) returns to its original value 
when we allow o to describe a closed path. This 1s to be answered 
mm the affirmative, since the group manifold of wis simply connected 
in the sense that any closed curve can be drawn together into 
a point by a continuous deformation. For in accordance with 
equation (7.5) the elements of the group are mapped in a one- 
to-one continuous manner on the quadruple («Ap v) of real numbers 
which are subject to the condition 


Ke DP 2 fe vB ex U, 

Hence the group manifold has the same topological properties 
as a 3-dimensional sphere in 4-dimensional space. These con- 
siderations thus show that the 7 root (16.7) is broken up into 
n single-valued continuous functions over the entire group 
manifold. The method of proof here employed, which ts of 
fundamental importance in the whole of mathematics, 1s perhaps 
best known to the reader in the proof of Cauchy's integral 
theorem ; it follows from the fact that the integral of an analytic 
function is locally single-valued, that it is single-valued a the 
large if the region in which we are operating is simply connected. 

The result of our topological considerations showed that 
the formula (16.6) defines » single-valued continuous functions 
U(o). One of them is such that in it U(f) is the unit matrix ; 
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we henceforth denote it alone by U(o). On writing the equation 
U(o) U(r) = &(0, r)U(or) (16.8) 


for 7 = I, and taking into account the fact that U(l) = 1, we 
find 8(s, 1) = 1. On forming the determinant of both sides of 
(16.8) we obtain the equation 


1 = [8(o, 7)]”. 


d(a, 7) is consequently an xn root of unity which depends con- 
tinuously on 7 for fixed o and which reduces to 1 for 7 = 1; 
hence it is identically equal to 1, and (16-8) becomes 


U(a) U(r) = Ulor). 


Consequently the only ray representations of Us are also vector 
representations, and our considerations show that thts theorem ts 
valid for any continuous group whose elements constitute a simply 
connected manifold. On going over to the 3-dimensional rotation 
group 03 by stereographic projection, all ;, even those with 
half-integral 7, are single-valued when considered as ray repre- 
sentations. Any single-valued continuous ray representation of 
03 is reducible into irreducible constituents, and the only irre- 
ducible ray representations are the D; (7 = 0, 1/2, 1, 3/2, ++ -) 
obtained earlicr in the chapter. But Dj, 1s not simply connected ; 
we must resort to a two-sheeted covering surface, similar to 
a Riemannian surface but without cuts or branch points, which 
is simply connected. This accounts for the fact that there 
exist irreducible ray representations of D3 which may be single- 
or double-valued vector representations, but there cannot exist 
multiple-valued representations of higher degree. 

I have been able to prove the same theorem for the 2-dimen- 
sional rotation group (m = 3).44 This means that there exist 
two closed continuous motions (1.e. motions which lead back 
to the initial state) of a rigid body, which is free to rotate about 
a fixed point 0, such that any other closed motion can be con- 
tinuously deformed into one of the two. One of these may be 
taken as rest, and the other is such that it cannot be continuously 
deformed into rest. 


CHAPTER IV 


APPLICATION OF THE THEORY OF GROUPS 
TO QUANTUM MECHANICS 


A. The Rotation Group 


§1. The Representation Induced in System Space by 
the Rotation Group 


N accordance with ITT, § 8, we can interpret the theory of 

a single electron in a spherically symmetric electrostatic field, 

as developed in IT, § 5, in the following manner. A rotation 
of physical space, 1.e. an orthogonal transformation from the 
Cartesian c)-ordinates xyz into x’y’z’, induces a unitary trans- 
formation U(s):f—> Wi’ defined by 


p(x'yis') == plxyz) (1.1) 
in the system-space Kt of the electron, the vectors of which are 
the wave functions &(xysz) describing the state of the electron. 
The correspondence s —> U(s) is a definite representation ©, of 
infinitely many dimensions, of the rotation group 03. This 
representation & can be reduced into its irreducible constituents 
®,, and it is found that each , with integral / occurs an infinite 
number of times. The total system-space %h 1s correspondingly 
decomposed into mutually orthogonal sub-spaces R(al) ; Rr) 
has 2/-+ 1 dimensions and the rotation group induces the 
representation QD, in it. If we introduce in addition the im- 
proper rotations (0,) D, always appears in & with the signature 
(— 1)'. The oo-dimensional sub-spaces Bs Rn!) associated with 


the various values of / are uniqucly determined, but their further 
decomposition into the summands R(/) is quite arbitrary. In 
particular, this can be done in such a way that the energy of 
the states composing R(2/) has a definite value E(#/). 

We now calculate the operators induced in system-space 
by the infinitesimal rotations of physical space. Denoting the 
increase oh’ (xyz) -— %(xyz) by dys, equation (1.1) becomes 


dip + & ax + May + as) = 0 
ae 
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for the infinitesimal rotation s which sends 
x,y,z into xv =x+tdx, y=ytdy, 2 =2+4+ dz. 


Taking as s the three infinitesimal rotations D,, D,, D, 1n turn 
[II], (15.4)] and writing the corresponding infinitesimal unitary 
operators in the form 


1 
dis = (Ley Ly, L 3), 


we find 
1, dD 
bo (95 - “). (1.2) 


his accordingly the moment of momentum [cf. IJ, (4.9)]. 

On going over from one electron to two, the vectors of system 
space are the functions b(x,y42,; %2Ve%) of the Cartesian co- 
ordinates of both electrons. The unitary transformation 
U:%-—>w’ induced in system-space by the rotation s is now 
defined by the equation 


Wb (x13) ; %9Vo2q) = W(X1V121 5 X2Vo%0), 
/ 


where xjy}2, and x,y,z, are obtained from x,y,2, and XeVo2, 
by the same orthogonal transformation s. This situation can 
be described as follows: The state space Rt? of the system con- 
sisting of two electrons is & xX & and the representation @? 
induced in it is © x &. 

This representation is, as we see, determined by the kine- 
matical constitution of the system alone, and 1s in no way 
influenced by the dynamical relationships; the rule for x- 
multiplication for the induced representation on composition 
of partial systems presupposes only kinematical, not dynamical, 
independence of the partial systems. 

We can, without further trouble, formulate the situation 
discussed above in terms of the general scheme of quantum 
mechanics in a manner which is independent of the particular 
assumptions of Schrédinger’s scalar wave theory. This is all 
the more important since it has all along seemed doubtful 
whether the matter waves could be described in terms of a 
single state function #. We sct up an analogy between the actual 
displacement of the state of the system in time and the virtual 
change produced by an arbitrary rotation of space. The 
transition from time ¢ to time ¢’) changes the (arbitrary) state 
r at time ¢ into a state r’ at time U’, obtained from xy by a unitary 
transformation U corresponding to a displacement of the time 
axis which sends ¢ over into ¢’. The displacements along the 
time axis constitute a one-parameter continuous group which ts 
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isomorphic with the group of transformations U associated 
with them in system-space. The former group is generated 
from the infinitesimal displacement t—> ¢-+ dt, and it therefore 
suffices to give the infinitesimal unitary operator 


associated with it in system-space. We called the Hermitian 
operator H the energy. 

On subjecting the physical system (or the spatial co-ordinate 
system in terms of which it is described) to a virtual rotation s, 
the state Y goes over into another state rz’. Since nothing 
intrinsic to the system is changed thereby and since the state 
space ® is linear and unitary, the transition U(s):r4—> 2’ 
associated with s must also be linear and unitary. As in the 
case of the group of actual displacements in time, this group 
of virtual rotations in space must induce a certain representation 
Yt in the system-space RW; this latter is more properly to be 
considered as a ray, rather than a vector, space. But if we go 
over from tne rotation group to the unimodular unitary group 
U, (or Uy) by stereographic projection (III, § 8) and take this 
latter as fundamental, it is, in accordance with III, § 16, not 
necessary to distinguish between ray and vector representations. 
The group of proper rotations can be generated from its infini- 
tesimal operations, and we may take as a basis for these the 
infinitesimal rotations D,, D,, D, about the x-, y-, and s-axis. 
It then suffices to know the infinitesimal unitary transformations 


ay == -(M,, M,, M,)v 
which they induce in system space. We call the real physical 
quantities of the system which are represented by the Hermitian 
operators M,, M,, M, the x-, y-, g-components of the moment 
of momentum ‘SX. In order to express them in terms of the 
usual units they must, as was also the case with the energy, 
be multiplied by the quantum of action kh. The moment of 
momentum plays the same réle with respect to the virtual rotations 
of space as the energy with respect lo the actual displacements in 
time. 

One argument for the appropriateness of our definition of mo- 
ment of momentum is that in the case of the Schrodinger theory 
it leads to the usual formule of classical mechanics. As a further 
justification we prove the general theorem that the moment of 
momentum so defined is constant in time. We saw in I, 88, 
that the necessary and sufficient condition that the physical 
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quantity represented by the Hermitian operator A be constant 
in time was that A commute with the Hermitian operator H 
induced by the infinitesimal displacement of time. In exactly 
the same way we can show that the commutativity of A with 
M,, M,, M, constitutes the necessary and sufficient condition 
that the quantity represented by A remains unaltered under the 
virtual proper rotations of space, 1.c. that A is a scalar with 
respect to these rotations. Now the energy is a scalar, hence 


HM,—-M,H=0,°-°:.. 


But, on the other hand, these equations assert that M,, M,, M, 
are constant in time. 

The infinitesimal rotations generate only the group of proper 
rotations ; in order to obtain the complete orthogonal group we 
must supplement them with the reflection 7 in the origin, or 
extend the group u, to the group uy by the addition of the ele- 
ment ¢ (III, § 8). ¢ will induce a unitary operator / in system 
space which commutes with all U(s), in particular with the 
moment of momentum St = (M,, M,, M,), and which satisfies 
the equation JJ =1; this shows that / is Hermitian, as well 
as unitary. A quantity A which is unchanged by reflection 
must commute with /; hence, in particular, the energy H 
must commute with /. The physical quantity represented by /, 
which we call the signature, ts constant in time, as 1t commutes 
with H. It has, in common with all quantities arising in group 
theory which are not associated with infinitesimal operators, 
no analogue in classical mechanics. 

We reduce the total system-space into invariant sub-spaces 
with respect to the group of displacements in time; such an 
invariant sub-space is carried over into itself by the generating 


Boo ita a . ] 
infinitesimal operation dy == Ft. Since we are here dealing 


with a one-parameter Abelian group, or with a single operator H, 
this reduction can be carried to the point in which all the con- 
stituent sub-spaces are l-dimensional. The states contained in 
one of these invariant sub-spaces we call quantum states. 

We now proceed in exactly the same manner to reduce the 
representation ¥t induced in system-space by the group of rota- 
tions into its irreducible constituents D;. We make use of the 
fact that these are known to us a priort ; only the number of 
times they appear in 3¢ depends on the particular representation 
M. (Of course, we have not as yet shown that the 9, really 
constitute a complete system of irreducible representations of 
D3, and it may seem risky to apply the process of reduction to 
the oo-dimensional representation Yt. This procedure can, 
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however, be justified on the basis of the fact that D4 is a closed 
group. But in the final formulation of quantum mechanics it 
will not be necessary to base our conclusions on such general 
considerations, as the reduction into 9, will be obtained by 
elementary means.) The entire system-space ® is thus decom- 
posed into sub-spaces ®t, WR. + + + such that §, is of dimension- 
ality 27 + 1 and the representation induced in it by the group 
u, is D;. On adapting the co-ordinate system in system-space 
to this decomposition the variables fall into classes 


x(m) (m == 7, j—1,-- age | 
x'(m) “(mi a Pig df ae dy tee Sey 


under the influence of an arbitrary transformation oa of u,, 


applied to the variables €, 7 the co-ordinates of system-space 
transform in accordance with the law 


x(n) ~ el Coe ey ee 


With the reduction of Sk or Jt is associated the reduction of the 
angular momentum Yt; in the sub-space ; the components 
of Mo are given by III, (15.9), from which it follows that the 
square M? of the moment of momentum has there the fixed 
value 7(7+ 1). (It is evident from general considerations 
that M? must be a multiple of the unit matrix in ;, for it ts 
a scalar and must therefore commute with all the operators of 
the irreducible representation ®,.) If the state of the system 1s 
represented by a vector lying in ®,, the g-component of its 
moment of momentum is capable of assuming the values m = J, 
jy—1,°°++, —7; the z-component naturally only apparently 
occupies a preferred status, due to the fact that the co-ordinates 
in §; were chosen in a manner which differentiated the 2-axcs 
from the others. That M,, M? cana priori assume only discrete 
values m, (7 + 1) is essentially due to the fact that the rotation 
group is closed; since the group of displacements in time is open, 
the analogous result for the energy need not in general hold. 
In this connection we wish to emphasize again that the operator 
H depends on the dynamical relationships existing in the system, 
whereas the representation Jt induced by the group of rotations 
is determined only by the kinematical situation (number of 
elementary particles, etc.). The signature I also assumes a 
definite one of its values +1 in each sub-space R;. For lack 
of a better name we call the states which lie in the sub- 
space &,, which is invariant under the group of rotations, 
‘‘ simple” states of inner quantum number j. We must 
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be prepared to find that 7 may here assume half-integral as well 
as integral values, in contrast with the Schrodinger theory. 

On uniting two kinematically independent systems, with 
system-spaces $t, ®’ in which the rotation group induces the 
representations ¥, Yt’, the total system has as system-space 
R x MR’, in which the representation MN x M’ is induced. In 
particular, the moment of momentum of the total system ts 


(Mx 1) + (4 x M) 


where Jt and Mt are the angular momenta of the two partial 
systems. The theorem that the moment of momentum behaves 
additively with respect to composition is contingent only on the 
assumption that the parts are kinematically independent, 
whereas the corresponding theorem for energy applies only if 
they are dynamically independent, 1.c. in the absence of inter- 
action between the parts. This difference is based on the fact 
that whereas the energy represents that actual change of state 
in the course of time, the moment of momentum represents 
the virtual change associated with a fictitious rotation. We 
reduce #, R’ into the invariant irreducible sub-spaces Ry, Rj. 
respectively, 1.e into the simple states of the two partial systems 
having inner quantum numbers, 7, 7’. The Clebsch-Gordan 
equation (IIT, § 5) 


Dy XK Dy = Djs + Die tet + Dy (1.3) 


then tells us: Jf the two parts are in the simple states with inner 
quantum numbers 7, 7° then the whole has each of the simple states 
with inner quantum number 


J=j4+, G47 -VMees G7 


associated with it, each exactly once. ‘To include the signature 
we must add: Jf the parts have as signatures the values 6, 8’ 
(6 = + 1), the signature of the whole has the value 8 8’. 

Compare the results which we have obtained with the 
corresponding results in classical mechanics. In both the moment 
of momentum is constant in time and the moment of momentum 
of the whole is equal to the sum of the moments of momentum 
of the two parts. Denoting the magnitude of the moment of 
momentum in classical theory by j, we have, in agreement with 


(1.4), 


(1.4) 


j—7| SJ sj+7, 


for the resultant of two vectors of magnitudes j, 7’ is a vector 
whose magnitude J lies within these limits. Quantum mechanics 
deviates from classical mechanics in the following three respects : 
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1. In quantum mechanics the square of the moment of momentum 
is 1(7 + 1), tn classical mechanics it 1s 7? 

2. Here 3 can assume only the discrete values 0, 4, 1, 3, 
there 1t may have any non-negative value ,; ; 

3. Here the J obtained on compounding two partial systems 
can assume only those values between | 7~--7\,) +7 which differ 
from them by an integer, there it can assume any value between these 
limits. 

Already before the rise of the new quantum mechanics a 
semi-cmpirical description of the regularities observed in spectra 
had been given with the aid of a vector model consisting of the 
vectorial moments of momentum of the individual electrons 
and df the atom as a whole; the observations, assisted by the 
older quantum mechanics, had already led to these three modi- 
fications of classical theory.! 

The reader will perhaps have wondered why we consider 
only the virtual rotations of space and not the translations, 
which must also be taken into account in order to arrive at a 
complete description of the homogeneity of space. The reason 
for this is that in studying atoms or ions we treat only the 
electrons as particles, taking the nucleus as a fixed centre of 
force situated in the origin. That this ts at least approximately 
correct 1s due to the fact that the mass of the nucleus 1s many 
times the mass of the electrons. Space is thereby transformed 
from a homogeneous into a centred space; such a procedure 
naturally allows us to consider only atoms or tons, which have 
a single nucleus. Diatomic molecules are accordingly described 
with the aid of the l-parameter group of rotations about the 
axis joining the two nuclei, and not by the full 3-parameter 
group of rotations of space—to this we must add reflection in 
the plane which bisects the axis perpendicularly in case the two 
nuclei are physically equivalent.? If we are dealing with three 
or more fixed nuclei the symmetry either disappears entirely or 
is reduced to at most a finite group of rotations.? 


§ 2. Simple States and Term Analysis. Examples 


To each characteristic value E’ of the energy H there belongs 
a definite sub-space ft’ of R, the sub-space of quantum states 
with energy level £’; it consists of all states y which are trans- 
formed into E’:r by the operator H and is accordingly the 
characteristic space ‘(E’) associated with the characteristic 
value E’ of H. Since the energy is a scalar, the considerations 
applied in the preceding paragraph to the total space ® can also 
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be applied to Rt’: § is invariant under the operators induced 
in system-space by the rotation group and is consequently the 
carrier of a certain representation of this group, which can be 
reduced into its irreducible constituents. If the energy levels 
are of at most finite multiplicity we are faced with the problem 
of reducing only representations of finite degree. Accordingly 
ft is decomposed into the ‘‘ simple spaces’’ i, associated with 
the rotation group in such a way that not only the square of 
the angular momentum and the signature have definite values 
in R,, but also the energy has a sharply defined value E;. This 
energy level /; is necessarily (27 + 1)-fold degenerate; we 
speak of an accidental degeneracy when the energy levels of 
different simple sub-spaces %t,; are equal. /, M,, M*? and H 
are all simultaneously in diagonal form; that this is possible 
is due to the fact that these four operators all commute among 
themselves. Jn this way the reduction into simple states can be 
employed in term analysis: each energy level Ej; possesses an 
inner quantum number 7 which gives the term the natural 
multiplicity 27 + 1. 

On subjecting the atom to a perturbing field which destroys 
its natural spherical symmetry this (27 + 1)-fold term is broken 
up into 27 + 1 terms. Let the perturbation, 1.e. its Hamiltonian 
function W, possess axial symmetry about the z-axis; if &, 
possesses no accidental degeneracy, then in accordance with the 
theory of perturbations the perturbed energy levels are given to 
a first approximation by the portion of the Hermitian operator 
W in which &, intersects itself : 


x(m) —> S'W(m, m’) x(m') (mo = 9,7 — 1, +++, — 9). 


The rotation about the z-axis with meridian angle ¢ transforms 
x(m) into e(— m@) - x(m), and in virtue of the symmetry assumed 
for W this correspondence of i; on itself must also be represented 
by 
e(— mp) + x(m) = SW(m, m’) - e(— mi) x(m’), 
or 
W(m, m’) e[(m — m’)p] = W(m, m’). 

But this means that all elements W(m, m’) except those in the 
main diagonal vanish, whence 

Ey + W(m, m) (2.1) 


are the 27+ 1 perturbed terms. The quantum number m, 
which is capable of assuming the values 7,7—1, °° -+, —4, 
thus serves to label these components. Perhaps the most 
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important axially symmetric perturbation is that due to a 
homogeneous magnetic field in the direction of the z-axis 
(Zeeman effect); because of this m is called the magnetic 
quantum number. ‘The inner quantum number 7 of a term 
can be determined spectroscopically by counting the number of 
terms appearing in the Zeeman effect. Sommerfeld first con- 
cluded, from the spectroscopic data, that 7 as well as m must be 
allowed to assume half-integral values. If we consider the 
Zeeman effect to be described by the analogue of the classical 
formula IJ, (12.5) then 


as ORS _. 19! ) 
= 3; OM) =hoM., o= 9% (2.2) 
and W 1s rigorously in diagonal form : 
Wim, m) == hom. (2.3) 


Our analysis shows that the breaking up of energy levels due 
to an axially symmetric perturbation parallels the reduction of 
an irreducible representation of the rotation group D3; when this 
is restricted to the group D0, of rotations about the z-axis: by 
this D, is reduced into the 27 +- 1 one-dimensional representations 
which we have previously denoted by D™ : 


x(m) —> e(— md) - x(m). 


If two kinematically independent parts, which are in the 
simple states Rt;, Rj, are compounded together, the state of 
the composite system is in the (27 + 1)(27’ + 1)-dimensional 
product space Rj; == WR; x Ry. If the parts have the energies 
E;, FE‘. then the whole has the energy -; + E}., assuming no 
interaction between the parts. Introducing a weak interaction 
between the two partial systems and assuming that there is no 
accidental degeneracy, i.e. assuming that all the remaining 
energy levels of the unperturbed system are different from £,,, 
it suffices, to a first approximation, to consider the section 
<H) of the energy operator H in which §R,, intersects itself ; 
it is an Hermitian correspondence of Wj; on itself. We can 
apply the considerations, which were applied above to the total 
system-space R X MR’, to each of these R,;-: Rj; 1s to be de- 
composed into sub-spaces belonging to numerically distinct 
characteristic values of <H>. The rotation group induces a 
certain representation in cach of these sub-spaces, and this 
can be further decomposed into its irreducible ccnstituents. 
The result is that R; X Rj, is, in accordance with the Clebsch- 
Gordan series, reduced into the simple spaces Ry, J =j +7, 
j+j'—1,-°+, |f—7'|, in such a way that in each of them 
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the energy <H)> has a definite value Ay. Different Ay can only 
‘‘ accidentally '’ have the same numerical value. Consequently 
the term £,; is broken up by the perturbation into terms Ey 
in exactly the same way as the representation D; x @,- is 
reduced into the irreducible representations Dy. But this 1s 
only correct to the approximation characteristic of perturbation 
theory. As we have seen above, an inner quantum number 
J can be rigorously ascribed to a term &; 1n the approximation 
with which we have been dealing here there ts associated with 
it in addition the inner quantum numbers 7, 7’ of the parts, in 
the last analysis of the electrons themselves: the energy level 
E arises from a definite term &;, of the unperturbed system bv 
interaction of the two parts. Such an association 1s rigorously 
possible for ‘‘ simple states,’ but the rules based on it lead only 
indirectly and approximately to an analysis of the terms.‘ 


Examples 


If we take the Schrodinger scalar wave theory to be valid 
for a single electron, then a simple quantum state of the electron 
in the field of the nucleus is characterized by the principal 
quantum number » and the azimuthal quantum number / (we 
here use the word “ azimuthal” instead of ‘“‘inner’’). Such 
a term is (2/ + 1)-fold degenerate, and we assume there is no 
further accidental degeneration. The moment of momentum 
is represented by the operator % taken over from classical 
theory ; the square of its absolute magnitude is /(/ + 1) and 
the signature has the value (— 1)!.__ If f electrons come together 
to form an atom we obtain a term, neglecting interaction between 
the electrons, 


FEe(tyl,) + E(t.) +--+ + + E(u, l,) (2.4) 
of multiplicity (2/,; + 1) + ++ (2l,-+ 1). The quantum numbers 


7 and / refer to the individual electrons. The interaction causes 
a separation which parallels the complete reduction, obtained 
with the aid of the Clebsch-Gordan series, of 


D, x D), ) a2 Diy (2.5) 


into its irreducible constituents Dz, with total azimuthal quantum 
number Each such term ts associated with the quantum 
numbers 

(1), Mylo, + + +, aely; L). (2.6) 


If f 2 3 certain D, appear more than once in (2.5), and we may 
therefore have several (2L + 1)-fold terms associated with the 
same set (2:6); these must then be distinguished from each 
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other by some further index. The square of the total moment 
of momentum is L(L + 1) and the signature (— Os Vasa 2) 
In spectroscopy it is usual to characterize the valucs / = 0, 1, 2, 3, 
4, + + + by the small Latin letters s, p, d, f, - + - and the values 
L =: 0, 1, 2, 3,+ + + by the corresponding capitals S| P, D, F, +++. 

We cannot expect the scalar wave theory to be correct, 
but must be prepared to describe the state of the wave field 
in terms of a quantity yw with several, say a, components 
(b,, Bo, °° *, Wa), Le. by a covariant quantity of a definite kind 
9%. Each component is a function of the spatial co-ordinates 
xyz; the components will depend on the choice of the Cartesian 
co-ordinate system in such a way that on going over to a new 
co-ordinate system by the rotation s the components will undergo 
among themselves that transformation A(s) which corresponds 
to s in the representation %. Again, consider d; replaced by u, 
as the fundamental group. The general component #,(xyz) of 
the ‘ vector ’ & has two indices, the index « running from | to a 
and the index (xyz) running through all the points of space. 
Let ®, be the vector space of functions (xyz) and R, the 
a-dimensional vector space; the state space of a single electron 
is then RK, xX KR, Under the influence of the rotation s which 
sends xyz into a‘v’s" the state Ww goes over into the state yp’ 
defined by the equation 


pa (x'y'2’) == Lay Pa(xy2), | ae] == A(s) 5 


the representation induced in system-space 1s accordingly 
M=-= Ux E The moment of momentum IM of the electron 
consists of two parts : 


M= (Sx H+ 1x &, (2.7) 


the first of which refers to the a-dimensional ** spin space’ Ma, 
the second to the “ translation space’? R,. (1 xX L,), or simply 
ly oO 0 ; 
L,, 1s the operator “(955 5) which acts on each of the 
1\" 02 
a components in the same way; it affects only the index (ryz), 


Pie: . 
leaving the index « unaltered. oSz is the unitary transformation 


corresponding to the infinitesimal rotation about the x-axis in 
the representation %; (S, x 1), or simply S,, consequently 
affects only the index « and leaves (¥yz) unchanged. Only 
the part @ appears in classical mechanics ; we call it the orbital 
moment of momentum, and the remaining part © the spin 
moment of momentum, or simply the spin. Its appearance 
is unavoidable so long as the wave quantity % is not simply a 
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scalar or a set of scalars. Each of the two parts satisfies separ- 
ately the commutation rules IIT, (15.7), but in general only the 
total angular momentum satisfies the law of conservation. If 
the quantity #% is of a simple kind, i.e. if %& is an irreducible 
representation D,, then @ = 2s + 1 and the spin © is equal to 
the moment of momentum J, associated with the representation 
D,. 
Since the Schrédinger theory has proved itself at least 
approximately correct, one should assume that to a first ap- 
proximation each of the components w, satisfies the Schrédinger 
scalar wave equation. So long as we consider this approxima- 
tion, the a components have only the effect of multiplying the 
multiplicity of each energy level by a2. But in reality the correct 
differential equations must contain a term, the ‘spin per- 
turbation,’’ which introduces a coupling between the various 
components ,. The electron can thus be considered in 
abstracto as a composite system, consisting of the electron 
translation with system-space R, and the electron’ spin 
with system-space ¥,; the spin perturbation is the weak inter- 
action between these two. Because of this the method of 
composition can here be applied. Let & = D,. Decompose the 
translation space Rt, into the (2/ + 1)-dimensional sub-spaces 
Ri(nl) ; the corresponding energy term E(nl) with azimuthal 
quantum number | has, on neglecting the spin perturbation, the 
multiplicity a(2/ + 1) and its characteristic space is the space 
R, X Ri(nl) of the same dimensionality. On taking the first 
order spin perturbation into account this term is separated 
into the terms /, with inner quantum number 7 and ‘multiplicity 
(27 + 1) in a manner paralleling the decomposition of the repre- 
sentation D, x D, into its irreducible constituents: 


Dx D,= 2D, j=stis+il—1,:-.-, ll—sl, (2.8) 


with the aid of the Clebsch-Gordan series. Care must be taken 
to differentiate sharply between the azimuthal and inner quantum 
numbers / and 7. The latter is capable of assuming the values 
given in (2.8); whenever / 2 s the number of different terms in 
such a “ multiplet” is 25+ 1. L? 1s approximately equal to 
the constant /(1 + 1), S*? is approximately equal to the constant 
s(s +1), and M? is rigorously constant and exactly equal to 
(7 + 1). We can thus speak of the azimuthal quantum number 
of an actual energy term only to within the approximation 
characteristic of perturbation theory. It is well to set forth 
these considerations beforehand and to approach the spectro- 
scopic data, as we shall in § 4, with them well in mind. 
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§ 3. Selection and Intensity Rules 


We return to the consideration of our system as a whole, 
without resolving it into its individual electrons, and again 
denote the total inner quantum number by j. Let A be any 
physical quantity of the system, and let it be represented by 
the Hermitian form A; we write that portion of this form in 
which §; intersects Rj, in the form 


o a(mm')x(m)x'(m"), (3.1) 
where the indices m, m’ run through the values 
Lf ss a = I, en ie m’ = ie 7 as 1, es eas Ne (3.2) 


If the quantity A 1s a scalar, the operator <1 commutes with the 
operators U(s) induced in system-space by the rotations  s. 
On decomposition into these irreducible sub-spaces Ry, Rj, it 
follows from the fundamental theorem III, (10.5), of the theory 
of representations that the section (3.1) of 1 corresponding to 
the transition R,—> ®, is sero if j’ +7 and a multiple of the 
(27 + 1)-dimensional unit form 

o £(m) x"(m), 

m 
if 7 == 4. 

An analogous situation exists for the group Dd, of rotations 
about the z-axis. With respect to it the total system space 
decomposes into l-dimensional invariant sub-spaces ™ in 
which the rotation with angle @ induces the representations 
Di) : x(m) -> e( — md) x(m). If we only assume that the physical 
quantity A possesses axial symmetry about the s-axis it follows 
that the coefficient a(mm’) is necessarily zero when the magnetic 
quantum numbers m and m’ of the initial and final states are 
different. 

We now consider a vectorial quantity q with the three 
components gz, dy, Jz instead of the scalar quantity A. This 
is of particular importance because such a quantity, le. the 
electric dipole moment q of the atom, determines the interaction 
between the atom and radiation—to that approximation tin 
which the linear dimensions of the atom may be neglected in 
comparison with the wave-length of the emitted light. If the 
degeneracy of the energy level EF; is destroyed by an external 
axially symmetric perturbation, e.g. a homogencous magnetic 
field in the direction of the z-axis, then the spectral line caused 
by the transition MR; —> Rj, from the term £; to &;, is broken 
up into the lines associated with all possible transitions 
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(Ry, m) +> (Rj, m’). On calculating the part of the Hermitian 
form representing the electric dipole moment in which the sub- 
space WR, intersects M;.: 


Lq(mm')x(m)x'(m'), (3.3) 


the ratios of the squares |q(mm’)|? of the absolute values of its 
coefficients determine the relative intensities of these (27 + 1)(27’+1) 
lines. Since q, is axially symmetric about the z-axis g.(mm’) == 
unless m’ = m; we thus have the selection rule 


dz: m—>m (3.4) 


for the g-component of the electric moment. On _ performing 
the rotation with angle @ about the z-axis x(m), dz -t 1Gy, Ge —— 14y 
are multiplied by e(—md@), e(¢), e(—d) respectively. Since 
E(m)x'(m') is therefore multiplied by e[(m — m')¢] we obtain 
the selection rules 


Get igdy: m>m-— 1, qr—qy: m>m+1 (3.4') 
for the x- and y-components of q. Only the transitions 
m—> m—i1, m m+ti (3.5) 


of the magnetic quantum number are allowed; the first and the 
last generate two waves which are circularly polarized in the xy- 
plane in opposite directions, and the remaining transition m —> m 
generates a wave which 1s linearly polarized in the z-direction. 
If the equation (2.3) holds for Zeeman effect, the wave number 
of the component m --» m’ is displaced by an amount o(m —- m’) 
from its unperturbed value. Thus in “ normal Zeeman effect’ 
we obtain instead of (27 + 1)(27’ + 1) components only three, 
whose polarization 1s as described above and whose wave numbers 
are displaced by the amounts 0, +0. That the resolution of 
the two terms E,;, Ej, is almost entirely hidden is due to the 
fact that the factor of proportionality ho in (2.3) has the same 
value for both terms. Fortunately most of the cases actually 
observed show “ anomalous Zeeman effect,” in which the resolu- 
tion of the terms can be seen clearly; in order to explain it 
we must change the expression (2.2) for the perturbation due 
to the magnetic field. But the above selection rule for the 
magnetic quantum number, which has heen obtained from 
fundamental principles of group theory, is valid in all cases. 

The selection rule for the inner quantum number 7 is obtained 
in an analogous manner. The three components qz, qy, qz of q 
suffer the transformation s among themselves when the x(m), 
x'(m') are subjected to the transformations corresponding to 
sin the representations D,, D, respectively. Or, if we wish to 
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express it in terms of u, instead of D3, s is that transformation 
which 1s associated with the element o of u, in the representation 
Q,. This is, of course, mercly an expression of the fact that q 
is a vector. Now, in accordance with the terminology intro- 
duced in III, § 14, (3.3) is a vectorial quantity in the representa- 
tion space of 2D; X Dy, and we are interested in determining 
how many linearly independent quantities of this kind there 
are, Their number is given by the number of times ®, is 
contained in D, X D, or D, X D, as an irreducible constituent. 


But in accordance with (1.3) , occurs in D, X 9D, exactly 
once if 


J=7-—-1 or 7 or j+] 


and otherwise not at all, and we must further exclude the case 
z= O, pos O. We thus obtain the selection rule 


ak et tae a eo (3.6) 


with the proviso that O— 0 does not occur. Since there exists 
but one linearly independent vectorial quantity in the repre- 
sentation space of D, x ®, in the cases in which the selection 
rule is satisfied, the components of q(m, m’) are determined by 
purely group-theoretic considerations to within a constant factor 
of proportionality. 

In order to calculate the vectorial quantity (3.3) forj’ = 7 — 1 
we proceed as follows. Let €, 7; &, 7’ be two arbitrary points 
on the unit sphere which transform cogrediently under 4, 
Ef’ +. im’ is then the fundamental invariant, and the three 
forms which are obtained from 


Loge 4. aoa 

7 (EE + am’) (3.7) 
by multiplication with 

- ag n*, oy (3.8) 


transform in the same way as the (x + 7y)-, (« — ty)-, s-com- 
ponents of a vector, respectively. They are linear in the 
monomials @%* of degree k + 2 = 27 and in the monomials 
é’r'y’s’ of degree k = 27’. Introducing 


ér ; Elly! 
= HL, (m') = ae 
Vris! Vr ls’! 


(Q—=-rt+s=k+2, m=r—s, 4’ =r + s' =k, 
2m' = r' — Ss’) 
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as co-ordinates in the representation spaces of D,, Dy we find 
that the three forms above are of the type (3.3) with 7’ = 7 — 1. 
For example, we obtain for the (x + 1y)-component 


(£6')"2(Fin')* xn Eras g'r-Bp!s — 
2 5 wet oe 1 aed 


(r—2)+8=k 
=— LV (j + m)(j + m — 1)2(m)x'(m — 1). 


In agreement with the selection rule m + m — 1 there occur here 
only those terms for which m'=m— 1. Calculating the 
(x — 1y)- and z-components in the same way, we find for the 
transition 


(Miia Moa exes 
(ae + igylon, m — 1) = — VEE GF mY, 
(Gz — 1qy)(m,m+1)= VG—m)\(j—m—1), (3.9) 


gem, m) = = =V (7 + m)(j — m). 
In order to calculate the components for the transition 7 = 7’ 
we must replace the factors (3.8) by 

2n'&, 2é°7, gt —_ 17 
which also transform like the (x + 1y)-, (x —- 1y)- and 2-com- 
ponents of a vector. Finally, for the transition 7’ = 7 + 1 we 
must replace (3.8) by 7%, — €, &’y’. Since the angular mo- 
mentum Jt is a vector, the formule for the transition j > 7 
must naturally agree with those already obtained for M [III 
(15.9)], and since q is Hermitian the formule for the transition 
j->j +1 must agree with those obtained by taking the 
Hermitian conjugate of the components for the transition 
j>j—}. 


J>jJ =] 
(Gx + 1gy)(m, m — 1) aces Vj Si m (7 a + 4), 
(qe — 1qy)(m,m+1)=VG—mj+m+), — (3.9) 
q.{m,m) = mm. 
Mc Mc Do 


(qe + iqy)(m, m—1)= VG—m+VG- m+ 2), 
(qn -- iqy)(m, m + 1) = — VG 4+ m+ 1G + m + 2), (3.9) 
g.(m, m) = Vij +m + 1)(7 — m + 1). 
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In each of these three sets of formule the right-hand sides are 
determinate only to within a common factor of proportionality 
which is independent of m, but which can be completely deter- 
mined only by integrating the wave equation of the dynamic 
model of the atom, and not by the theory of groups alone. 
The coefficients which do not occur explicitly in the above 
formule are all null. The squares of the absolute values of these 
coefficients yteld the (rational!) intensity ratios of the components 
into which a line 1s split by the perturbation. 

Already before the rise of the new quantum mechanics the 
intensity formule (3.9) for the components of a line emitted 
under the influence of a magnetic field were obtained from the 
observational data under the guidance of the correspondence 
principle.& In the new quantum mechanics they are, as we 
have seen, a consequence of the most general principles, and we 
would find ourselves in serious difficulties if they were incorrect. 
Nevertheless it 1s to be remembered that they can be invalid 
(1) if the spherical symmetry of the system is destroyed by 
external perturbing fields, or (2) if for short wave-lengths the 
interaction Letween matter and radiation 1s no longer determined 
primarily by the electric dipole moment. 

Since the dipole moment is a proper vector, as the components 
Jz, Wy Jz ZO Over into — gy, — gy, — gz on reflection 7 in the 
origin, the representation ®, induced on them by wy has as 
signature — 1. If the signatures of , Rj are 6, 6’, then under 
the influence of the reflection 7 (3.3) 1s multiphed by the factor 
66°. The coefficients q(mnm’) must accordingly all vanish unless 
66° -- — 1: the selection rule for the signature ts 


Sas iD. 


If the individual electrons are governed by the scalar wave 
theory the total azimuthal quantum number L of the atom 
can jump only to 1 — 1, 1 or + 1, while the sum of the ast- 
muthal quantum numbers of the individual electrons l, 4-l,4+-+ ++ +ly 
can change only by an odd integer (Laporte’s rule). In the case 
of a single electron, f= 1, only the transitions 7 >/- | are 
consistent with these rules ; this result has already been obtained 
in IT, § 5, from the theory of spherical harmonics. 

The formula (3.9) allow us to solve a problem which we shall 
here, for the sake of future application, introduce from the 
physical standpoint. A partial system in the simple state ®, 
is compounded with a second in the simple state Kt, to form 
a single system. In 9, = Ry X Mj, Us induces the representa- 
tion D-= D, X Dy; Iet the corresponding moment of mo- 
mentum be 9. On adapting the normal co-ordinate system 
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in Rj; to the complete reduction of D into its irreducible con- 
stituents Dy, WM is broken up into square sub-matrices My of 
length 2/ + 1, arranged along the principal diagonal, corre- 
sponding to the decomposition of ,; into sub-spaces My. But 
the same is not true of the moment of momentum IM, x 1 of 
the first partial system, and we wish to determine the portion 
of this matrix in which Ry intersects itself. That is, in physical 
language, we wish to determine the temporal mean value «<M, 
of the moment of momentum of the first system in the state 
defined by the quantum numbers j, j'; / of the two parts and 
the whole. We assume that the interaction between the two 
parts resolves the energy level F,, into distinct levels Ey on 
applying the theory of perturbations. Since Yt, is a vector we 
know, from the same considerations as we applied to the electric 
dipole moment above, that the portion of it corresponding to 
the transition ] — J must be a multiple of Mt: 


ONG Se Dp SON: (3.10) 


In order to evaluate the proportionality factor « we construct 
the scalar product of the matrices (Mt, x 1) and Wt; since 


M==(M, x +1 x M,) 
these two matrices commute and we have 


(A x M,)? = M2 + (M, «x 12 — 2MM, x 1) 
or 


22MM, x DHi+VY G++ M, (3.11) 


for since in the original co-ordinate system (Mt x 1)? was 
j(7 + 1) times the unit matrix, it remains the same in the new 
co-ordinates. And, on the other hand, IN(M, x 1) is equal 
to x; J/(/ + 1) times the unit matrix in the sub-space iy, as 
follows from (3.10). Hence from (3.11) 


JU +1) =1G+ 0) -7F + FIV EY, 
wet Se oe I Se!) 
w= 34ST _— 


§4. The Spinning Electron, Multiplet Structure and 
Anomalous Zeeman Effect 


We have hitherto ignored the fact that the terms of the 
alkali spectra, characterized by the two quantum numbers n, J, 
are in reality not simple. Each of these terms--with the exe 
ception of the s terms / == 0—actually consists of a fine doublet. 
By § 2 the (z, 7) term should be resolved into 2/ + 1 components 
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in a magnetic field; instead we find that one of the doublet 
terms breaks up into 2/ components and the other into 2/ + 2. 
We should accordingly ascribe to them the inner quantum 


numbers 7 = 1 — ; y= l-- : respectively. 


Our general considerations immediately give us a hint as 
to how this discrepancy is to be explained. The quantity 
describing the wave field is not a scalar, but is instead a covariant 
quantity of the kind ®,, having two comporients (b,, po). This 
is the theory of doublet phenomena as developed by W. Pauli.8 
It seems indeed easy to arrive at this conclusion after the 
preparation of the preceding paragraphs, but historically this 
systematic foundation was developed only after Pauli’s dis- 
covery. It is quite immaterial whether we associate the matrix 
+ 1 or the matrix — 1 with the element ¢ in the representation 
D, of uz. Taking the first of these altcrnatives, the signature 
has the value (— 1)! in the quantum state (dj) ; hence Laporte’s 
rule remains rigorously correct on taking the spin into account. 
We have as further rigorous selection rules those concerning 
the total inner and the total magnetic quantum numbers. In 
the representation 2, the transformation o itself corresponds 
to the element o of uy, and by III, (15.6), the spin moment of 


i. ak . 
momentum is 5°; where © is the vector already defined with 


components 


=|) of =e 


=|) | 


We shall not as yet attempt to find the specific effect of the 
spin perturbation on the wave equation. This was done origin- 
ally by picturing the electron as a small material sphere, the 
rotation of which gave rise to the spin; the additional moment 
of momentum required by spectroscopic observations was first 
introduced in this way by Goudsmit and Uhlenbeck.’ Since 
S, is capable of assuming only the values + 1 it appears as if 
the spin axis can only be quantized along the positive or negative 
Z-axis; we need not go into the false conclusions this assertion 
can lead to on interpreting it literally. The spin perturbation 
must appear in going over from classical to relativistic mechanics. 
The terms of the hydrogen atom, calculated in accordance with 
the scalar non-relativistic wave mechanics, depend only on the 
principal quantum number m, but the theory of relativity intro- 
duces a correction which causes the terms corresponding to the 
various values of / to split apart and form the so-called fine 
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structure. We should therefore expect the same scheme of 
terms in hydrogen as in the alkalies, but observation shows 
that the doublet separation of an / term into two terms with 


Las 
golt 5 18 just such that two terms with the same j, but with 


different /=j + exactly coincide. Hence the spin per- 


turbation in hydrogen agrees quantitatively with the separation 
caused by the relativity correction. 

The alkali doublets show anomalous Zeeman effect. Other 
elements, such as alkaline earth metals, have (in addition to 
triplets) a system of singlet terms, and singlet terms always 
show normal Zeeman effect in a magnetic field. It therefore 
seems probable that the anomalies in Zeeman effect are closely 
connected with the spin. The magnetic separation of an alkali 
term is quite independent of the principal quantum number n ; 
all the terms of a series behave in the same way. A term (J, j) 
splits up into 27 + 1 equi-distant components, characterized by 
the magnetic quantum number m, but their separation is hog 
instead of ho, where g is a rational function of / andj (the ‘‘ Landé 
g-factor’’). The energy value of the component m is therefore 
displaced by an amount 


hog:m (m==j,j7 —1,° ++, —7) (4.1) 


from its unperturbed value. The empirical formula for the factor 
g, which is due to Landeé, is 


243+ 1 
This formula holds for weak magnetic fields, in which the separa- 
tion is of a smaller order of magnitude than the doublet separation. 

ae | 

If l = 0, j = Dy 

This latter fact gives a hint toward the solution of the puzzle : 

If the total moment of momentum consisted only of the spin 

(= 0), its magnetic effect would be twice as great as if it con- 

sisted of & alone. We therefore assume that the magivelic effect 
ae: eee 

of the spin 5 © is twice as great as that of the orbital angular mo- 


we have in particular g = 2. 


mentum %; the perturbation due to an external magnetic field §) 
1s therefore to be taken as 


_ eh __ eh l 
W = 92 © + S) = 977 (%, M+ 9S). (4.3) 
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The spin offers an explanation of why the beam in the Stern- 
Gerlach experiment is separated into two parts. The valence 
electron of the univalent silver atom is, in the normal state, 
. I 
in an s-orbit (J = 0); hence 7 = 5 


and m can assume only the 
] 
values £5. Although the component of the mechanical 


moment of momentum in the direction of the magnetic field 
h 
can have only the values + 5’ the experiment shows that the 


value of the magnetic moment of the atom is a whole Bohr 
magneton, and not the half of one; but we now see that since 
the mechanical moment of momentum consists only of spin 
it should give rise to twice the expected magnetic moment. 
The connection between magnetic moment and mechanical 
moment of momentum is even more apparent in the maguneto- 
mechantwcal effect: the demagnetization of a vertically suspended 
bar of weak iron must result in giving to 1t an angular momentum. 
The ratio between the change in the magnctic moment and the 
moment of momentum was expected to be ~~, but the experl- 
2c 

ment, which was performed only on ferro-magnetic bodies, 
yielded twice this value. The anomalous magnetic behaviour 
of the spin also accounts for this result, if we assume that the 
mechanical moment of momentum in ferro-magnetic substances 
is due entirely to the electron spin. 

Does this hypothesis also explain the general Lande formula 


(4.2)? This is answered by the formula (3,12) obtained toward 


; oo, ea eh ] = 
the end of § 3, in which J, 7’, / must be taken as =, /, 7 in order 


2 
that it apply to the composition of electron spin and electron 
translation. We find that in the state (Jj) the temporal mean 


value of the spin ac is equal to J multiplied by the factor 
lL ¢—l@4+1)) 
ee ee Oe ee 
. aT SFT 


or 
l . 1 
ey ee eae == ~, 4.4 
g—]1 oye for gol (4.4) 


Hence by (4.3) 


(W> = ine -g+ (OM) = hog M,. 
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So long as the magnetic separation is small compared with the 
spin perturbation the Zeeman separation of the term (lj) is 
determined primarily by <W),; (4.4) then leads, in fact, to 
equation (4.2), in agreement with the empirical data. 

If the atom consists of several, say f, electrons, the situation 
then arising can be understood with the aid of the general rule 
of composition. If the electrons are in quantum states with 
inner quantum numbers 7, and energy levels [(7,), (r = 1, 
2,°++, f), then on neglecting the interaction between the electrons 
the total system has a (27, + 1) +++ (27, + 1)-fold energy level 
E(q,y) +- +++ E(yy). Tf this level coincides with none of the 
other levels it is resolved by a small perturbation into terms 
with total inner quantum numbers / in a manner corresponding 
to that in which the product 


D;, x D;, We 8 KR Dj, = > Dy (4.5) 


is reduced into its irreducible constituents Dy (Clebsch-Gordan 
series). Obviously in order that this (77) coupling lead to an 
adequate description the mutual interactions between the 
electrons must be small compared with the spin perturbation, 

The situation usually met is, however, the opposite of that 
contemplated above: the normal term order corresponds to 
the Russell-Saunders or (s/) coupling. Neglecting for the moment 
the interaction between the electrons as well as the spin per- 
turbation, we are led to a 2/(21, + 1) +++ (2/, + 1)-fold energy 
level (2.4) in whose characteristic space the rotation group in- 
duces the representation 


Dy x (D, x D, X+ ++ xX D,). (4.6) 


Due to the interaction between the electron translations the 
second factor 1s reduced in a manner analogous to (4.5); a 
single term with azimuthal quantum number L has now the 
multiplicity 2/(2L + 1). We next reduce 


= DD, (4.7) 
and finally, as the last step, we carry out the reduction 
Dx Dp= LD, (J=lLt+s,L+s—1,+++, |L—sl), (4.8) 


associated with the coupling between the spin and the orbital 
moment of momentum. The terms which result from this 
last reduction form together a multiplet. Each multiplet is 
therefore associated with a definite azimuthal quantum number 
L and a spin quantum number s; the individual members of 
the multiplet are distinguished by the inner quantum number J. 
We call 2s + 1 the multiplicity, although the number of terms 
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in the multiplet is only actually equal to this when L = s, as 
by (4.8) their number is less if L <s. The 2/-dimensional 
representation Dj is even or odd according as f is even or odd. 
The reduction (4.7) into irreducible constituents accordingly 
yields only integral values for s when f is even and only half- 
integral values when f is odd: The term multiplicities alternate 
regularly between even and odd as we run through the atomic table 
in the order of increasing atomic number (H even, He odd, Li 
even, Be odd, etc: ‘‘ alternation law’’). For f= 2 we have, for 
example, 


Dj = Do 4. ®,. 


It is empirically found that the bivalent alkaline earth metals 
have in fact a singlet and a triplet system of terms. But in the 
triplet system the S terms, for which L = 0, are simple; only 
the P, D,+ + +, terms have the actual multiplicity 3. 

Instead of considering all the electrons at once as in (4.6) 
we can build up the atom by successively adding one electron 
after another. On adding a next electron, say the /", to an 
atom or an ion A‘, a multiplet of A’ characterized by azi- 
muthal quantum number ZL and spin s breaks up into all those 
multiplets contained in the representation (O, x D4) x (@, x D)), 
where J, = /is the azimuthal quantum number of the electron 
added. Since 


D, x 2, = Ds 4 +. ‘ee 
D0, 2D, OP Sa, Bae eS Hed, 


this results in multiplets (s*, L*), one for cach of the pairs 


) 


Magi. Meat Pade) ote ed (4.9) 


(‘ branching rule’). Vhe alternation law is again contained in 
the first of the above equations. It 1s to be noted, however, 
that the Pauli exclusion principle for equivalent orbits, which 
will be discussed in part C of this chapter, materially restricts 
the array of multiplets allowed by this rule.® 

Again applying (3.12) to the composition of spin and orbital 
moment of momentum, we find that the 2/ + 1 components 
into which a J term of a multiplet (s, L) is split in a weak magnetic 
field are displaced from the unperturbed positions by the amounts 


hog-m (m=J,J—1,°--°, —/J) (4.10) 
where the separation factor g is given by 


—,, JU+ty)—LL+) +55 +1) 
g=1+ I(T +1) : (4.11) 
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This is exactly the formula which was derived empirically by 
Landé ; we here see the importance of the fact that the square 
of the absolute value of the moment of momentum J (or @ or G) 
is calculated from the quantum number / (or Lors) by J/(J/ + 1), 
etc., instead of /?, etc., as in the older quantum: mechanics. 

When the magnetic field increases to such an extent that the 
magnetic separation becomes comparable w'th the separation 
between the terms of the multiplet we must handle both the 
perturbation to which the multiplet separation is due and the 
magnetic perturbation together. In order to express the small. 
ness of the term in the Hamiltonian function to which this 
former perturbation 1s due, we introduce a factor p which will 
appear in the same way as the factor o in the magnetic term ; 
the case of a weak magnetic field may then be expressed by 
saying that o is small in comparison with p. We can consider 
o and p as variables which increase gradually from 0 to their 
actual values and follow the dependence of the separation on 
their ratio. We therefore write the perturbation term in the 
Hamiltonian function in the form 


W = pW’ + oW”. 

Since the decomposition (4.8) need not for present purposes 
be expressed in terms of its ultimate constituents, the individual 
electrons, we may here denote the azimuthal and inner quantum 
numbers by / and 7. Let the representation spaces of D,, D, 
be t,, RR, with co-ordinates E(m,), x(m,) respectively. Denote 
the moments of momentum M,, Met, of these two representations 
by 8, 2 respectively ; if the magnetic field has as its direction 
the z-axis, then 

W" = h{(L, + 2s,). (4.12) 
The co-ordinate system is again to be so chosen that the rotations 


about the z-axis appear in reduced form; to such a rotation 
of angle ¢ corresponds the transformation 


E(ms) > e(— mah) E(m.),  x(my) > e(— mip} + x(m)) 5 
the range of the quantum numbers m, and m, 1s given by 
mM,=5S,5—1,+++, —~s; m=ll—l,:::, —Ll (4.18) 


The variables of t, x 8, then behave like the (2s + 1)(2/ + 1) 
products 


E(m,) + x(m,) (4.14) 


and are multiplied, under the influence of a rotation ¢ about 
the z-axis, by e(— md), where 


m =m, + mM, 
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We now reduce ), X ®, into its irreducible constituents 9,. 
Let the co-ordinates of the (27 + 1)-dimensional irreducible sub- 
space of t, < &,, in which the representation D, takes place, 
be denoted by 


aj; m) (m=j,j—1,-+-, — 9). 


m is the magnetic quantum number, 1.e. under the influence of 
the rotation ¢@ about the z-axis x(7 ; m) is multiplied by e(— md). 
The co-ordinate transformation which leads to the complete 
reduction of ®, x ®, into its constituents DB; is obviously of 
such a kind that x{7, m) is a linear combination of those of the 
variables (4.14) for which m, + m, has the value m. 

If the unperturbed system possesses no accidental degenera- 
tion the separation is determined by that part of the matrix 
(4.12) in which the sub-space t, x R, of Rf intersects itself. 
We must therefore solve a secular equation G of degree 
(2s + 1)(2/ + 1); but the problem is materially simplified by 
the fact that the perturbation term possesses rotational symmetry 
about the z-axis, as the only non-vanishing elements of the 
matrix W are those for which m > m. The one secular equation 
G is consequently broken up into 2(/ + s) + 1 secular equations 
G,, corresponding to the possible values 


m=Il+s5,l+s—1,-°+,—(l+5) 


of m. The degree of G,, is given by the number of possible 
partitions of m into two summands m, + m, which run through 
the ranges (4.13). In the case of a single electron, f= 1, we 
have only equations of the first and second degrees, and the 
calculation can therefore be carried through completely for this 
case. '0 

The roots of the secular equation G,, are the displacements 
of the energy terms due to the perturbation. Since the trace 
of a matrix is an invariant, the sum of the term displacements 
which are associated with a definite value m of the magnetic 
quantum number (the roots of the secular equation G,,) is equal 
to the sum of the terms in the principal diagonal of this portion 
of W, 1.c. to 


W(m,m,, m,m,). 
(m, +m, = ™) 


It is therefore a homogeneous linear function of p and o (** sum 
rule’’). We obtain the part due to the magnetic field by putting 
p= 0; by (4.12) this is 


oW"'(m, mM, Me m,) = ho(m, + 2m). 
14 
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On the other hand, the formule (4.10), (4.11) determine the 
term displacements in the case in which o is small in comparison 
with p. In consequence of the sum rule these two results must 
agree. / ands being fixed once and for all, we denote the Landé 
g-factor (4.11) by g(7), and we then have 


S(m, + 2m,) = m~ J'g(j). 


The sum on the left is extended over all partitions of m=m,+m, 
for given m, and that on the right over all values of 7 which 
are consistent with the conditions 


j= ml, jm[+1,---; j=l+s,l4+s—1,--:, lls. 


g(j) can in fact be determined from this equation. For m=I-+s 
both sums reduce to a single term; we then have 


[42s = (l+s)+ gl +9). 

For m==1-+s—1 there are two possibilities for (m,, m,) and 
two forj: m,==l,m,=s—1 orm,=l1—l,m,=s;j7=/l+s 
or?/+s5— 1. Consequently we must have 


2+ 4s— 3 = (I +s — fell +s) + ell +s —})}. 


In this way we obtain recursion formule for the successive 
calculation of g(il+s), g+s—1), -+:+. The reader can 
readily verify that the result of the first few steps agrees with 
(4.11). 

It is to be noted that in following the terms from a weak 
to a strong magnetic field they cannot cross each other, con- 
sidered as functions of the monotonic increasing parameter 
o:p; the “singular elements’’ of a unitary group, t.e. those 
elements for which two or more characteristic values coincide, 
constitute a manifold of three, and not simply one, fewer 
dimensions.!! 


B. The Lorentz Group 


§ 5. Relativistically Invariant Equations of Motion of 
an Electron 


We have as yet obtained no specific expression for the spin 
perturbation; that for the magnetic effect due to an external 
field was set up with the aid of the experimental facts. It is 
clear that we can arrive at a satisfactory theory of the electron 
only when we are able to express its fundamental laws of motion 
in a form which ts invariant under Lorentz transformations, as 
required by the restricted theory of relativity. The solution of 
this problem is due to Dirac.12 We saw in III, § 8, how the 
2-dimensional representation ®, of the rotation group, which, 
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following Paz, characterizes the covariant quantity yp == (xb,, Wo) 
describing the wave field, can be extended to the group of 
positive Lorentz transformations. y,, %, play the same réle as 
the variables £, 7 introduced in connection with Q,. 

Following de Broglie we took as the wave equation of a 
particle of mass m in field-free space 


52 52 2 1 *y =e (my _ *), (5.1) 


mit ptt A 2A 


But this equation is not in agreement with the general scheme 
of quantum mechanics, which requires that only first order 
derivatives with respect to the time appear. The formulation 
of a relativistically invariant differential equation satisfying 
this requirement is, as Dirac discovered, made possible by the 
transition from the scalar wave function % to one with two 
components. We seek to derive these dynamical equations 
from a Hamiltonian principle. 
Let 


Xo = CL, Xy =X, Ng = VY, NXg 2B 
constitute a normal co-ordinate system in our 4-dimensional 
space-time. If the quantity w is of the same kind as y, the 


quantities bS,w behave, in accordance with III, (8.16), like the 
four components of a 4-vector; the S, are the matrices defined 
in IIf, (8.15). Hence in particular 


B=0 


3 
z= dys 
Say 
Dy WS, IX z VB 


are the components ds, of an infinitesimal vector; we are here 
dealing with a linear correspondence which is independent of 
the co-ordinate system employed and which sends the vector 
dx over into ds. Its trace 


~ dus 
Sane 5.2 
Sys" (5.2) 
is consequently a scalar and its integral (multiplied by 1/2) 
] vy oy . 5 3 
M= 7 [20S <* -dx (dx = dxydx,dx_dxs), (5.3) 


extended over any finite portion of the world, is a quantity which 
is independent of the co-ordinate system.* 


* The letter M used for the material part of the action is not to be confused 
with the moment of momentum. 


212 APPLICATIONS OF GROUP THEORY 


Although M may not be real, it is practically real in the sense 


that M — M is the integral of a complete divergence. For 
since the S, are Hermitian matrices, 


and M — M is in fact the integral of 


To 5a) Je ) 

1S 
In using M as an action we are not interested in M itself, but 
only in its variations 8M caused by arbitrary infinitesimal 
variations dp of = (%,, ~.) which vanish outside of a given 
finite portion of the world (the integral is then extended over 
the entire world or, what amounts to the same, over this finite 
portion). The circumstances mentioned above guarantee that 
6M is real; on writing it in the form 


8M = [(Sp-w + & + dp)dx 


we find on comparison with . 3) that 


=p, 
“Wa 
We thus arrive at the first order differential operator 


0 
V = Sas (5.4) 
From the invariance of (5.2) it follows that this operator trans- 
forms = (,, #.) into a quantity p’ = (), 4) which trans- 
forms contragrediently to % = (,, J.) under the influence of 
an arbitrary positive Lorentz transformation. If we wish to 


guarantee that M is real, we may replace the original definition 
by 


—~)(v(js % _ »# 7 
M = ap r(9S.5" 5 p) + dx. (5.5) 

In ITI, § 8, we found it necessary to introduce quantities 
ob, we which transform contragrediently to %,, ~, in order to 
be able to extend the restricted Lorentz group to the complete 
group. And just as V applied to # generates a quantity of the 
kind ’, in the same way the “ — ’” operator 


v= as 


Se 
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transforms #' into a quantity of the kind %. V’'V is, as is readily 
verified, the operator 


x2 ( 9? 9? 9? 
- (+ +5). 


ee we : 
x2 ys an) cy 2 


Consequently equation (5.1) for #,, w. can be written in the form 


“Vy + my’ = 0, | 
; : (5.6) 
Vy’ + mop = 0! 


on introducing an auxiliary pair of components %’. From now 
on we denote the column of the four components yx, bo; yy, be 
by w and employ S, as the symbol for the transformations of 
these four components as in the latter part of Chapter III; 
with this understanding the differential equations (5.6) arise 
from an action integral which is composed additively of the 


quantity M, (5.3), and the invariant [cf. III, (8.19)] 
M = mo \ PT ip AX. 


M and M’ are also invariant with respect to interchange of 
right and left, and under the spatial reflection z in the origin. 

In accordance with the general scheme of quantum mechanics 
the differential equations for % should, as already remarked, 
contain only the first derivative of % with respect to time; the 
additional requirement that it be relativistically invariant then 
leads to the conclusion that it can also contain only first de- 
rivatives with respect to the spatial co-ordinates. We have 
here been able to satisfy these requirements without altering 
the actual content of de Broglie’s equation (for the components 
WJ, w,); the equations thus obtained are to be taken as the 
equations for a free particle. This formal transition to first 
order equations will become physically significant only when 
we pass to the derivation of the equations of motion in an electro- 
magnetic field with the aid of the principle of gauge invariance 
developed in IT, § 12. According to it, if — ¢9 is the scalar and 
d1, 2, 6; the vector potential, we must replace 


1 9d e 
by = +54. (5.7) 
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It will be found convenient in the following to introduce the 
quantities /, obtained by multiplying the potentials 4, by the 


factor = Then in 
he 


M = ;|¥- Vib + dx (5.8) 
the operator V is defined by 
1 esl ; 
7V = ES. 55 + fa). (5.9) 


Because of this gauge invariance the quantities M, M’ are 
unchanged on replacing simultaneously 


yw by es and f, by ia (5.10) 


where A is an arbitrary function of position in space-time. Now 
take A to be an infinitesimal function which vanishes outside 
a certain finite portion of the world: then 6M and 6M’ must 
automatically vanish for the variations 


Sb = id- py, if. — 2, (5.11) 


Xo 
The complete expression 


3(M + M') = J((Sp-w + &- Bp) + Ys Bf,jdx 


for the variation automatically tells us that under the assumption 
that the laws of matter (5.6) are satisfied, 1.c. that w = 0, 


8(M + M’) = ) Y's* 8f, « dx. 
Hence we have as a consequence of the laws of matter 


dA os” 
a Ca ie [ame pee, Sanaa: rae | 
| 2s ve ax {a x ax 


ro 


i.e. the continuity equation 


— = 6. (5 12) 
A glance at the explicit expression for M shows that 

s* == PS, pb; (5.13) 
these are the quantities which formed the starting-point for 
the theory of the transformations of as developed in IT], § 8, 
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and we already know that they form the components of a 
4-vector which is independent of the particular space-time 
co-ordinates employed. The time component 


s° == db = (dis + dove) + (Hid, + days) (8.14) 


is the probability density and hence c$ = c(s}, s?, s%\ is what 
may be called the probability current: in order to obtain the 
number of particles which will on the average pass through 
a surface element do in time unit, multiply the total number of 
particles present into the product of the area do and the normal 
component of the vector ¢8. On integrating the equation (5.12) 
over a volume V we find that the increase in the mean number 
of particles in V per unit time is equal to the mean number of 
particles entering V through the surface in unit time. In 
contrast to the provisional scalar theory, the Dirac theory leads 1n 
a most natural way to expressions for the probability density, as 
well as the probability current, which depend on alone. 
On integrating 


{3° dx, dX_ dx, 


over the whole of space we find that the integral 1s independent 
of time—and, in accordance with the statistical interpretation 
of y, is to be so normalized that its value is 1. Consequently, 
in the dynamical law 


1 dip 

->= + Hp=0 
1 at se 

the energy H/h is a Hermitian operator, as should be. We 

shall from now on take h as the unit of action, with corresponding 

units for linear and angular momentum. The result of this 1s 

that the quantity h disappears completely from the laws of 


ae 1) 
quantum mechanics. With the usual abbreviation, p, = oa 
1 3 
-H = fot 3 Sule + fo) + oT. (5.15) 


The influence of the electro-magnetic field on the matter is 
taken care of by (5.9), but, on the other hand, the matter gener- 
ates the electro-magnetic field in accordance with Maxwell’s 
equations. In order to express this explicitly we must add to 


M + M’ the Maxwellian action 


F = 5 | + fit fh) — (fo + fot fadiex (6.18) 


216 APPLICATIONS OF GROUP THEORY 
of the electro-magnetic field, where the 


Wp of 
Soup ae eee 
Xn OX, 


are the field strengths—which are unaffected by the change of 
gauge (5.10). F is obtained from 


: | ca — &)dV dt (5.17) 


2 2 
by multiplication with (=) = = (5.17) is the action in 


Heaviside units, which are best adapted to the electro-magnetic 
field theory. Since we have taken h as the unit of action, the 
total action of our system, consisting of matter plus field, is 


W=MimM+-F (a=" 5.18 
=M+M'+<7F (a=4). ene 
For reasons which will be apparent later the real number a/4z7 
is called the fine structure constant, Whereas the variation 
of the % in the Hamiltonian integral §W - dx yields the equations 
of matter, variation of f, leads to the equations of the electro- 
magnetic field with 


—ers*= —e- PS, (5.19) 


appearing as the 4-vector of charge and current density. The 
only constants occurring in the field equations are the two 
combinations 


of fundamental atomic constants; the first is a reciprocal 
length and the second a pure number. 

Schrédinger. in his fundamental papers on wave mechanics, 
thought he could explain the quantum behaviour of matter 
and radiation ‘classically ’’ by setting up a closed system of 
field equations such as we have obtained above. In particular, 
he held that the charge of the electron was actually ‘‘ smeared ”’ 
over the whole of space with the density — e- 5°. But there can 
be no doubt at the present time that the field equations are not 
to be interpreted in this classical manner; they must rather 
be interpreted in accordance with the statistical view-point 
developed in Chapter II. The expression (5.14) for the density 
then guarantees the atomistic structure of electricity. To show 
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this we first remark that the charge in a volume V is represented 
by — e times the Hermitian form 


if \ | yobdx jt Xy: 
. (V) 
But this is an *‘ idempotent ” form with respect to the “ vector ”’ 
wy; its characteristic values are 1, 0 and the corresponding 
characteristic functions are those quantities % which vanish 
outside or inside VY, respectively. The charge contained in V 
is accordingly capable of assuming only the values — e and 0, 
i.e. according to whether the electron is found in V or not. In 
order to guarantee the atomicity of electricity the electric 
charge density must equal — e times the probability density. 
But if we base our theory on the de Broglie wave equation, 
modified by introducing the clectro-magnetic potentials in 
accordance with the rule (5.7), we find as the expression for the 


charge density one involving the temporal derivative = in 
addition to #; this expression has nothing to do with the prob- 
ability density and is not even an idempotent form. According 
to Dirac this is the most conclusive argument for the stand 
that the differential equations for the motion of an electron in 
an electro-magnetic field must contain only first order derivatives 
with respect to the time.'% Since it is not possible to obtain 
such an equation with a scalar wave function which satisfies at 
the same time the requirement of relativistic invariance, the 
spin appears as a phenomenon necessitated by the theory of 
relativity. 

The theorem of the conservation of electricity (5.12) follows, 
as we have seen, from the equations of matter, but it is at the 
same time a consequence of the electro-magnetic equations. 
The fact that (5.12) is a consequence of both sets of field laws 
means that these sets are not independent, i.e. that there exists 
an identity between them. The true ground for this identity 
is to be found in the gauge invariance, for it is equivalent to 
the assertion that 5W vanishes identically when # and f, are 
subjected to variations of the form (5.11). We have 


BW = [{(Bp-w + BBY) + SL* Sfa}dx, 


where w= 0 are the equations of matter and L* = 0 the 
Maxwellian equations. On substituting the variations from 
(5.11) and integrating the last term in the integral by parts, 
Le “ oL* 
s(po—ap +s = 0. 


OX, 
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Because of the arbitrariness of the gauge the number of inde- 


pendent equations must be one less than the number of unknown 
functions # and f,. 


§ 6. Energy and Momentum. Remarks on the 
Interchange of Past and Future 


I, Energy and Momentum. 
The complete field equations are explicitly 


l 
ES(5 2 + fa) + my TH = 0; | 
“ 6.1) 
| 6 | 
div &+ p= 90, aia adc 


Where © and § are the electric and a field strengths : 


p is the charge density nbs, and the alana S,, °° * of the 
current $ are given by 


s=bS ib. (6.3) 


In addition to the differential law 
op 
a = 4 
me +. div 3 0, (6.4) 


expressing the conservation of electricity, we have a vector con- 
servation law governing energy and momentum. A completely 
satisfactory expression for the tensor representing density and 
flux of energy and momentum 1s only to be obtained along the 
lines employed in the general theory of relativity. Here we 
give only the result for the density of energy — c- #} and mo- 
mentum (é?, é&, 8), and in doing so we separate the material 


from the electro-magnetic part. We have for the part referring 
to matter 


= m4 == 2 {oSo(se- ae ify) ie ts “~ ify) . S| | 
; + mohTH; + (6.5) 
A= = 5, (0 — Sy) + (S~ S), - 


We have here introduced, in addition to S,, the operator S, 
(p = 1, 2, 3) which acts on all four components of #; whereas 
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the former subjects %,, w, to the 2-dimensional transformation 
Sp [III, (8.15)} and yi, f, to — S,, the latter exercises the 
same 2-dimensional transformation S, on both pairs of com- 
ponents. Correspondingly 


Sy = WSzy. 
The density of energy and momentum due to the electro-magnetic 
field is given by the familiar Maxwellian expressions 


— 6 = s{(Ei + +) + (Ar + +)}; (6.6) 


t) = (E,Hg = F3A,), oe 


J 
We find the conservation laws 
3 308 3 Dt 
—_ = 0: J =O -:- 
a=(0 OX “, 0X4 . 0 


as consequences of the field equations. Furthermore, the tensor 
t is symmetric—not identically, but in consequence of the field 
equations; in this sense we have 

R+2=0(p=1,2,3); &=2(p,qg=1,2, 3). (6.8) 


On combining these with (6.7) we obtain the divergence con- 
ditions 


3 O(%_ tf — Xgl) 

a Wa re 0, ) (6.9) 
d(Xo ti + x, 9) = 

2 a) ae — 0, (6.10) 


These results can all be verified directly, but their deeper 
significance can be understood only by going over to the general 
theory of relativity as mentioned above. Just as the theorem 
of the conservation of electricity follows from the gauge in- 
variance of the equations, the theorems for the conservation 
of energy and momentum follow from the circumstance that 
the action integral, formulated as in the general theory of 
relativity, is invariant under arbitrary (infinitesimal) transforma- 
tions of co-ordinates. In this gencral relativistic formulation we 
need further to erect a normal set of co-ordinate axes at each 
point P of space-time, consisting of four mutually perpendicular 
directions at P (‘‘ orthogonal ennuple”’), in order to fix the 
metric at P and to be able to describe the wave quantity # in 
terms of its components; all permissible orthogonal ennuples 
at P are obtainable from each other by local Lorentz transforma- 
tions which leave P invariant. But the rotations of these local 
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ennuples can be performed in the various points P quite inde- 
pendently—the quantities at various points are not bound to 
each other as in the special theory of relativity. The symmetry 
of the energy-momentum tensor can be traced back to the 
invariance with respect to such rotations. One can in fact 
take it as a general rule that every invariance property of the 
kind met in general relativity, involving an arbitrary function, 
gives rise to a differential conservation theorem. In particular, 
gauge invariance is only to be understood from this standpoint. 
It follows from the transformation laws for # that its four com- 
ponents &%, relative to the local ennuple are determined only to 
within a common factor e’* of proportionality, the exponent A 
of which depends arbitrarily on position in space-time; in 
consequence of this it is necessary, in order to obtain a unique 
covariant differential for y, to set up a linear form D’f,d«, which 


is coupled with the gauge factor contained in #% in the manner 
required by the principle of gauge invariance." 

We obtain the integral conservation laws from the differential 
ones by integration. We sct up the integral 


fiodV = J, (dV = dx, dx, dxs) 


over a section x) = const. of space-time and find that it is 
independent of x). — cJo=H is the energy and (J,, Js, Js) 
the linear momentum. ‘The material part is, on a simple in- 
tegration by parts, 


— Jy = fH S So(F m+ Sp) + mol bh 


1 Xp 
~ 19 
Ii= [be 5 Pav, eas 


These are Hermitian forms in the ‘‘ vector” J. They again lead 
o eo oa 
ax,’ dav, 0X3 
(J1, Je, J3) of linear momentum, i.e. to the assumptions with 
which we, following de Broglie and Schrodinger, began. For the 

energy we obtain (on dividing by c) the operator 


1 2 a) : 
7H= 2 S,(5 dX» + fp) + mol, 


without the additive term fy as in (5.15); the differential equa- 
tions of matter are therefore 


Gs Io + fa)¥ + =H = 0. 


1 
us to associate the operators = = ( ~) with the components 
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Moreover, we must not forget that to the part due to matter 
we must yet add that due to the electro-magnetic field. 
The quantities 


M, = \ (x. — xg 8)aV,- - -, (6.11) 


which are by (6.9) also constant, are the components of the 
moment of momentum. We find from (6.5) that the part due 
to matter 1s 
~(] d 0 ] 
m= fall —md) +38 
1 \H; ay “35%, 2 5 Sifudy, 

In agreement with our earlier assumptions we here obtain the 
operator which 1s composed of the sum of the x,-component 


] Py) 0 
(wax — X54 =) of the orbital moment of momentum and the 
0X3 OX 5 


: 1, 
spin moment of momentum = S;. The vector 


2 
a ors 
5 © pea 9 (Si, So, S3) 


is actually the spin, for in accordance with the law of trans- 
formation of both # pairs (f,, $.), (f,, #3) of components suffer 
the same transformation o as in the Pauli theory of the spin 
under the influence of the transformation o (spatial rotation) 
of Us. 

On integrating equations /6.10) over the spatial section 
X y = const. we obtain 


ad 
h= nae dg, Saute, my 
which we may consider as the law of inertia of energy. The 
H 
integral may be written /J)°&, = — -&,, where &,, &, &3 are 
the co-ordinates of the ‘“‘ centre of energy ’’; the equations are 
then 
Hf dé 
j= “32° —, ah 
cat 


We thus obtain the familiar mechanical law: Momentum 1s 
equal to mass times velocity, where the velocity is to be taken as 
that of the centre of energy and the mass as 1/c? times the energy 
content of the field. Nevertheless it is advisable nat to divide 
by H in defining the centre of energy, as the energy density 
— 19 is here no longer positive-definite, and we cannot be certain 
that the energy content // will turn out to be positive. 
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Our theory is a classical field theory, the quantum features 
entering only in the statistical interpretation. With this 
interpretation the field laws are concerned with a single electron. 
At the present stage of our development we can deal only with 
the additional quantities due to the electro-magnetic field by 
assuming a given external field affecting the motion of the 
particle, without the particle reacting on the field; we must 
then surrender our Maxwellian equations. The true laws 
governing the interaction between electrons and quanta will 
only be obtained, in analogy with II, § 13, on subjecting the 
system of field equations to the process of quantization, just 
as was done by Heisenberg for any system of classical mechanical 
differential equations. 

The fact that we are led back to our original assumptions 
concerning the operators representing positon and momentum 
is due to the particular expressions we have chosen for the 
action, from which the field equations were obtained ; indeed, 
it depends entirely on the part M. These original postulates 
of quantum theory are accordingly of less interest from the 
standpoint of general principles than we at first believed. But, 
on the other hand, this connection seems to indicate that M 
cannot be replaced in its role as representing the action due 
to matter. M is also responsible for the fact that the charge 
and probability densities agree, which is unconditionally re- 
quired as a guarantee of the atomistic structure of electric 
charge. These connections with the most fundamental physical 
observations thus require that the action be composed additively 
of M and further terms which are invariant not only under 
change of gauge (5.10) as is M, but also on replacing ~ by e4- f and 


0 
i. by fa — ve where A and pw are two independent arbitrary 


functions in space-time. J’ and the Maxwellian action F are 
in fact of this kind. Further relativistic invariant’ scalars 
satisfying these conditions are readily found—indeed it is not 
difficult to set up the most. general action possible with the 
quantities at our disposal. But we have yet to be convinced 
by physical observation that the three quantities M, M’, F 
here employed do not suffice. 


II. Electric and Magnetic Spin Perturbations. 


In order to be able to compare Dirac’s theory with the facts, 
we eliminate #, # in the same way as we did in the absence 
of the electro-magnetic field. We obtain the equation 


— VV = moy 
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with the new definition (5.9) of V and V’. The substitutions 
S, in two variables satisfied the equations 


SoS, = S189 = S13 S2S3 = — S38, = 18; ; 


and consequently those denoted by the same letters but operating 
on all four variables obey 


So51 =os 5156 = Si ; SoS. ae a0 = 1S}. 


V’V contains terms of the following four types : 

(1) (55. + fe) (5 + th), 
9 -(2+n)\(@+0) 
(3) Si (Se + the) (5 + th) — (Se +i) +ih)}, 
a) —s8if( 2 en)(2 +m) - (2 +m\(e+H)} 
) 


We collect together terms of types (1) and (2) to form the 


‘regular term’ in which the components of % are not coupled 
with each other : 


: 0 
~2,--) + Shr + sare. 
a CVa 


[The transition from lower to upper indices, i.e. from “ co- 
variant ’’ to ‘‘ contravariant’’ components, is performed in 
accordance with the equations /° = — fh, f? = f,(p = 1, 2, 3).| 
The irregular term consists of the electric part 


is (2 = ve) a aS Sits a) 


and the magnetic part 


si\(4 - vis) i AO ted) 


These become, on multiplying by the factor h and expressing 
the electric and magnetic field strengths © and § in the usual 
units, 


e Bie 
We have already (II, § 12) calculated the — term for a 
homogeneous magnetic field and found it to be - - (2). On adding 
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the regular and irregular terms we obtain, on neglecting the 
squares f2 of the potentials, 


= (9, Q+4+ 6’). 


This contains the fact, which was already derived in § 4 from 
a ee . 
spectroscopic data, that to the spin 5 ©’, twice as great a magnetic 


moment is to be ascribed as to the same amount of orbital 
moment of momentum; we have now obtained a convincing 
theoretical foundation for this procedure. The laws governing 
the interaction of a general inhomogeneous magnetic field with 
orbital and spin momenta emphasize still more emphatically 
the essential difference between { and ©’. The irregular electric 
term, calculated for the central-symmetric field originating in 
the nucleus, is the spin perturbation. 

The description of the electron given earlier, according to 
which it was a composite structure composed of two kine- 
matically independent parts—the electron translation, with an 
co-dimensional system-space, and the electron spin, with a 
2-dimensional system space—is, in view of the Dirac theory, 
no longer quite appropriate. But the classification of spectra 
given there is none the less valid here, for it depends only on 
the fact that to the group of rotations of physical space corre- 
sponds the representation ®, x © in the total system-space. 

From the field equations (6.1) as they are to be understood 
for the present, i.e. as the laws of motion of an electron in an 
external electro-magnetic field, dispersion phenomena can be 
(approximately) calculated ; they tell us how the motion of the 
electron in the normal or other quantum states 1s affected by 
the incident light wave. From the perturbed % we then deter- 
mine the scattered light with the aid of Maxwell’s equations ; 
to this class of phenomena belong in particular the Compton 
and Smekal-Raman effects.® Spontaneous emission can be 
handled similarly if we take the considerations of II, § 13, as 
justifying the following procedure: The polarization and 
intensity of light emitted by the quantum jump n— n’ of the 
atom is to be calculated by integrating Maxwell’s equations, 


where the expressions yp, wGy for charge and current density 


are to be understood as Pp) BMGPo), J being the 
characteristic function of the atom in the n“ quantum state. 
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IIT, Interchange of Past and Future. 


The action is so constructed that it is invariant under inter- 
change of right and left ; the corresponding substitution is 


Xq ~* Xo, Xp ~™ ——~ Xp y ) ‘ 

== 1,23 
JOP I. Sie es J ‘ : | 
pb, > Pi, wy-> by ; b> yy, Wy > ig’ 


Does a corresponding result hold for the interchange of past 
and future? The foundations of the theory lead to the hope 
that it will be able to take account of the essential difference 
between the two time directions, so obvious in Nature. But 
Dirac has remarked that M, M’ go over into — M, — M’ under 
the influence of the substitution 


(6.12) 


Ng PE De, ie ae — fy, (a Poe 0, 1, 2, 3) ) | 
trot torte hod, tee — J 


Hence when, in dealing with the motion of an electron in an 
external electro-magnetic field, we obtain a solution f% which 
contains the time in the factor e7'”, this substitution will lead 
us to anew s lution which contains the time in the factor e’”; or, 
more precisely, a solution of the problem obtained by changing 
finto -- f. But this can be done by retaining the same external 
field with potentials ¢ and replacing e by — e. We denote such 
a particle, whose mass is the same as that of the electron but 
whose charge is e instead of — e, as a ‘‘ positive electron ’’; it 
is not observed in Nature! It follows from what has been said 
above that the energy levels of such a particle are — hv, where 
hv are those of the negative electron. Disregarding this difter- 
ence in sign, the two particles behave the same. The electron 
will possess, in addition to its positive energy levels, negative ones 
as well, the latter arising from the positive energy levels of the 
positive electron on changing signs as above. Obviously some- 
thing is wrong here ; we should be able to get rid of these negative 
energy levels of the electron. But that seems impossible, for 
under the influence of the radiation field transitions should occur 
between the positive and negative terms. That we have twice 
as many terms as we should is obviously related to the fact 
that our quantity # has four instead of two components (satisfying 
first order differential equations). The solution of this dif- 
ficulty would seem to lie in the direction of interpreting our 
four differential equations as including the proton in addition 
to the electron. 


- (6,13) 
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The substitution (6.13) transforms the terms M, M’ of the 
action into — M, — M’', but leaves the Maxwellian term F 
unaltered. Our field equations as a whole, i.e. when we also 
take into account the reaction of the particle on the radiation 
field, are consequently not invariant under this substitution. 
However, there does exist a substitution which reverses the 
direction of time and which at the same time leaves all terms in 
the action invariant. We mentioned in III, §8 that the ex- 
pression (5.13) formed from a #% with two components takes on 
the sign 6,:5) = 1, 6, = — I(p = 1, 2, 3) on going over from 
wb, vw, to b,, — P,. Hence if w is a quantity which transforms in 
the same way as # then 


WS, w > 8° @ Sah 


on applying this to w = ro ox, we find that 
~ dy dip 
PS ar ae dy : yaa yb. 


Hence if we make in addition the substitution 


Kg > — Xp, Xp Xy (p = I, 2, 3) 
then 


and consequently M, formula (5.5), remains invariant. In the 
presence of an electro-magnetic field its components must 
change signs in accordance with 


to > fo, lg ey (p = 1, 2, 3). 


We have thus found that M, M’ and F all remain invariant 
under the substitution 


Xy > — Xo, Xp > Xp, ts 
to> fo fo — fo; i 7 I, % 3) (6.14) 
b> be, te > — fy; p, > By, py -> — Py 


This shows that the past and the future enter into our field 
theory in preciscly the same manner—in spite of the fact that 
the sign in the exponent of the time factor e~'” of a solution of 
the quantum problem is unchanged by the substitution (6.14). 
We must of course suspend judgment as to whether the laws 
governing interaction between photons and electrons allow us 
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to distinguish between these two directions in time until we 
have carried through the quantization (§ 12). 


§ 7. Electron in Spherically Symmetric Field 


We now proceed to the discussion of the behaviour of an 
electron in a spherically symmetric electrostatic field in Durac’s 
theory. 


I. Dirac’s Conservation Theorem. 


From the definitions follow immediately the commutation 
rules : 


Sof = TSS, = 7S, = 1; 2,3). 
We need further the results 
oe 6 a a 
and the commutation rules 
Ly Py — pPrly =9, (PX) = py Li + Pola + pals = 0, 
Ly p2— Poly = ips, Lyp: — pile, = — tps, 


for the components of linear and angular momenta) = (4, Po, ps) 
aid. 84g ges) 

In a spherically symmetric electrostatic field f, = fg = f,; = 0 
and f, == ® 1s a function only of the distance r from the centre. 
With the aid of the formule given above it is easily shown that 


Th, 
M,=1,+3S; 


commutes with ®, 7, (G’p) and consequently with each term in 
the expression 


“H = ® + (Sp) + mel (7.1) 


for the energy H. Indeed, this conservation law for the total 
] 
moment of momentum Jt = 2+ 5 8 was already known to 


us from general considerations. We further find that (©6’X) 
commutes with ® and 7, but that 


(S'L)(S'p) + (S'p)(S'X) = — 216'p) 


(S'p){(S'L) + YU + (SX) + YG'p) = 0. 


Hence (6’2) + 1 anti-commutes with {©’p) and therefore also 
with (Gp) ; its commutation properties with respect to the three 


Or 
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terms of (7.1) are therefore the same as those of 7. Hence on 
setting 


(SR) + 1 =k®T, (7.2) 


k is ascalar which commutes with the energy H (where by scalar 
we mean invariant under the group of rotations of space). 
Consequently we can decompose the system-space of the electron 
into irreducible sub-spaces *, associated with the rotation 
group, in such a way that the quantity &, which we call the 
auxiliary quantum number, as well as the energy H 
possesses a definite value in each of the sub-spaces. Now 


(SR)? = (LE + +) +4 {8,85 (Lely — Lele) + +3} 
= @—(SL,+4+)=2—-(6Q 


and consequently 


) 


7” _ ar ars 1 
(SQ + Y= L+ (GSH +1=(L+56') + p= M45 
Paka 7 

This agrees with 
, L\ ] 
m= jG +1) = (7 + 5) ~ | (7.3) 
when we put 
1 | 
peg (elie. (7.4) 


Accordingly, the auxiliary quantum number k 1s a non-vanishing 
integer. The conservation theorem (7.2) goes beyond (7.3) in 
giving us in addition the sign of k. For a given half-integral 7 


| 
the two values k= + (j -+- 5) are both possible; they must 


settee ae: : 
correspond to the two possibilities 7 = 7 + 5 of our previous 


notation. The single quantum number & replaces the two l, 7: 


II. The Differential Equation for the Determination of the 
Characteristic Values. 

Since the field is spherically symmetric, it suffices to carry 
through the calculation for the point x = 0, y=0, z==r. At 
this point 

2 0 2 oO 
1 dx’ cee 
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and the Dirac conservation law (7.2) becomes 
(7.5) 


together with the equations obtained from these by interchanging 
the two pairs o%,, % and wy, fw of components. The differential 
equation (6.1) for the characteristic vector %, which contains 
the time only in the factor e~’”", has as its four components the 
two 


[0 am) m) j 
Uys, =e i( = = i=.) U's -}- oui roa Mob, — 0 
./9 
Up, —1 a ix by — ee — Mo, = 0 


and two others of analogous structure ; we have here written 


Vv 


E=-, E-—-@®= U, 


’ 


The derivatives with respect to x and y which appear in (7.6) 
can be eliminated with the aid of (7.5); the resulting equations 


are 
feral Dr (mr tena] 


[ya get ome Bro] 


f= (0, 0, r), g Fe (0, 0, r). 


The remaining two equations are obtained by writing (2, wo) 
in place of (YW, w;). At an arbitrary point P = P(x, y, 2) the 
first and third components of w satisfy the equations (7. 7) in 
a rotated co-ordinate: system whose positive g-axis passes through 
P. Weshall find it convenient to introduce rf and rg as variables 
in place of f and g, as 


va a G taf 


If we wish to avoid the explicit appearance of 7 in the equations, 
we may write 


(7.7) 


where 


rf=v+iw, re=v— Ww 
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and obtain, finally, the fundamental equations 


Uy — mv —2w = 0 


dr 


(7.8) 
dv k 
Uw + Tp + Mo —- su 0. 


III. Spherical Harmonics with Spin. 


Let f(r), g(r) be a solution of equations (7.7); then in the 
rotated co-ordinate system 


b, =f. p, by = 8. p; tho == for, te = £.T 
where the factors p, +r are constants independent of r. On 
returning to the original co-ordinate system each of the pairs 


wy, we; 1, wo undergoes the transformation o associated with 
the rotation s. Consequently 


py ee tp1 - §71 by = §P1 a fry (7 9) 

Py = fp, + Ste | Po == Bp, + fre ) 
in which f and g depend only on 7, and the factors p, 7 only on 
direction, 1.e. on the spherical co-ordinates 6, ¢ introduced by 
setting 

x+iy=rsin 6e'? g=rcos @; 


the coefficients in (7.9) must further satisfy the conditions 


p,(1 — cos 8) — pg sin Oe'? = 0, (7.10) 
7,(1 + cos 0) + 7, sin Oe ** = 0. 


On substituting the expression for % in polar co-ordinates 
{II, (4.10)] into the Dirac conservation law, we are led, with 
the aid of (7.9) and (7.10), to the differential equations 


py a ee = 
sin er) —- 134 + k(1 + cos 6)7, = 0, 
OT, 


£1 qiOTG 

sin ew 1 x6 
We have thereby accomplished the transformation of the Dirac 
wave equation into polar co-ordinates. (7.9) corresponds to the 
substitution # = f(r) Y, of the scalar theory; in place of the 
single factor f depending only on the distance 7 we have here the 
pair f, g and in place of the surface harmonic Y, depending 
only on the direction we have the matrix 


(7.11) 
— k(1 — cos 6)p, = 0. 


P1 71 


P2 Te 


SPHERICALLY SYMMETRIC FIELD 231 


The equations (7.11), together with the conditions (7.10), define 
the ‘‘ surface harmonics with spin of order k"’; they are quite 
independent of the potential ®. The characteristic values E of 
the equations (7.7) or (7.8) are the energy levels associated with 
quantum number k, 

As in the theory of the ordinary spherical harmonics, we 
here again seek out those spherical harmonics with spin which 
contain the meridian angle only in the multiplicative factor e'™? : 


p, = em (sin O)-™+ P, +, = e'™ (sin O)-™- QO. 7.12) 


Substituting these expressions in (7.11) and taking 2 = cos @ as 
the independent variable, we find 


(1 — 2) = —mP + kQ, 
- (7.13) 


We denote the solutions P, Q of these equations which lead to 
non-singular functions p, t on the sphere more precisely by 
Pe, QM. It suffices to consider the case k > 0, for (— P, Q) 
is a solution of the equations obtained by changing & into — k: 


PO (2) == — PL), Os) = Oz). (7.14) 
Furthermore, 
i Ls Gaz 1) _— m) — Qtm-1) 


for the derivatives of P(™), Qt) ie the differential equations 
(7.13) with m — linplaceofm. Form= —k,P=1,0=—1 
is a solution which satisfies all continuity requirements on the 
sphere, since the multiplicative factor 


(sin 8)~™e'm™? == (x — 1y)7™ 
is finite for negative m. Consequently we find polynomial 
solutions of (7.13), the degrees of which are 0, 1, °° +, 2k — 1 


corresponding to the values m = — R, —k+,+:+, kR-1. 
The solution for m = k — lis 


P(g) = (1—2)FML + 28, Ole) = (1+ 2) (1 — 2) 
We thus finally obtain the following explicit expressions for the 
spherical harmonics with spin : 


dP 2 : m dP a rag 
Pa) = F{(L — 2) (1 + 2)", OMe) = Ft + 2) (1 —z)"} 
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where p=k—1-—m. They behave very much like the 
ordinary spherical harmonics. The following equations are 
also of importance : 


PO — 2) = (-— 1)? OL™z), O(— 2) = (— 1)”. PO™z) (7.16) 


§ 8. Selection Rules. Fine Structure 
I. Selection Rules. 


In a solution #% defined by (7.9), (7.12) ,, like p, and 7, 
contains ¢@ only in the factor e’™® and wp, like p, and To, only in 
the factor et(m* 1) ; aha for w,, #2. Hence 


M., = > 7 +g 3u= (m + + 5)b 
M,%, = 7d — 52 = (m +5). 


The z-component of the moment of momentum in the state 


(k, m) is accordingly m + : This change in the meaning of 
the quantum number m is to be carefully noted: m + * runs 


through the values 


] 3 
k—5 k—>@ i, MO ged edge Ge oa fe 
as it should. . 

In order to obtain the selection rules for the possible transi- 
tions (k, m) — (k’, m’) and to obtain the corresponding itensities 
we must calculate the matrix which represents the energy of 
interaction between the atom and radiation in terms of the 
co-ordinate system determined by the characteristic functions 
W™ defining the quantum states 2 of the atom. Proceeding 
as in II, § 13, we see from (5.15) that this matrix is 


3 
Soh, 
p=1 


The vector ec © here plays the same role as q there. The in- 
tensities are essentially determined by the elements ©(nn’). 
the three components of which are 


Sp(nn') = (POs, pordy, 


The selection rules are merely consequences of the fact that 
Gis avector. We first obtain the old result for m and 7 from 
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considerations involving the proper rotations of space. The 
rule for 7 asserts that the auxiliary quantum number k may go 
over into 


+(k—-1), +k, +(k4- 1). (8.1) 

To the reflection 1 corresponds the interchange T of the two 

pairs (W,, wo), (Yi, ys). In polar co-ordinates this reflection 

consists in the transition from (6, 6) to (7 — 8.7 + 4); 2 = cos @ 

is thereby transformed into — zg and the factor e’® takes on the 

sign (— 1)™. In accordance with (7.15) and the expressions 

for py, 71; pe, Te this results in an interchange of p,, 7, with 
possible change of sign, as represented by the substitution 

0 1 | 0 1 
ea es l k-1 

1 0 | (— 4) 1 0 


and the same for pg, 72. By (7.9) we therefore have for y with 
auxiliary quantum number k: 


Typ(— x, — y. -- 8) = (— I) ple, y, 2). 
The sub-space {R, thus has the signature 6 = (— 1)*7!;_ this 


result was cerived under the assumption k > 0. On replacing 
k by -- k and applying (7.14) we find in place of (7.16) : 


PON — 2) = (= LPO s), OO™(— 2) = (— 1)? PONLa). 


The signature corresponding to auxthary quantum number 


— k (k > 0) is accordingly (— 1)*. On setting 


(— pene 


U 


{ 


1 = — k when &k 1s negative G = -—k— _ (en 43 
(8.2) 


| | l 1\ | 
1 = k — 1 when k 1s positive (j=k—g=l+ 5) 


both possibilities are included under 6 == (— 1)’, or we could 
also write 6 = sen k+ (— 1)*"!. The only coefficients occurring 
in a proper vector are those corresponding to transitions in 
which the signature is reversed. Our selection rule (8.1) for 
kis thus narrowed down to 


k +> k—1,—k,k+1. (8.3) 


The following table gives the value of the auxiliary quantum 
number & associated with each possible combination of / and 7: 


Ge TIO LF Oe Bet 


jal—3 —1 —2 —3 ~—4--- 
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Il. Transition to the limit c > oo. 


In order to return from relativistic to ordinary mechanics 
we must pass to the limit c-— oo. Before applying this to 


equations (7.8) we must replace U, v by my + . cv; we then 


have, on neglecting : in comparison with aa 
dseéokR 
Uo = (G+ ym 
d ek 
2mw = — h( = —- =v 
on eliminating w we obtain 
hd k\fd ek 
aaa ml a | ard 
or 
had k(k ~ 1) _ 
gala pe JU + Uv = 0 


On introducing / by (8.2) we have in both cases k(k — 1)=/(/+-1). 
Hence in the limit terms with the same J, and therefore those 
with auxiliary quantum numbers k and — k — 1, coincide with 
that one associated with azimuthal quantum number / in the 
scalar theory of Chapter II]. The doublet found in alkali spectra 
—and in general the multiplet structure of spectral lines—as 
accordingly explained as a relativisiic phenomenon. 


Hie Tine & <3 
In a Coulomb field with nuclear charge Ze we have 


Za 
— °= Gr 
employing Heaviside units, which are better adapted to a field 
theory. In the following calculations we shall denote the 


multiple she of the fine-structure constant va simply by « itself, 


and we shall set myc == v9. In order to integrate equations 
(7.8) we first perform the substitution 


Pees k. wel G, 


where f is a positive constant. Our equations are then 


Be tra —H +o 


SELECTION RULES. FINE STRUCTURE 235 


Our method will lead to a solution if we choose the constant B 
in such a way that the determinant of the linear combinations 
of F and G on the right vanishes : 


v\? ¥o\? apne, 
(*) (*2) f= 0. ces V9? — v2. (8.5) 
We new seek a power series solution 
f= 2a Gap). 
where the exponent p begins with an initial value py and runs 


through the values po, fo + 1, wy + 2, °°. On substituting 
these in (8.4) we obtain the recursion formule 


(H+ Ry — ay == (& — ay + BOu a, 
Pheu hie: = Bay — (+ ?)ba 


C 


(8.6) 


The initial exponent p = py is determined by the fact that the 
determinant of the coefficients of a,, b, on the left must vanish 
for this value of the index: 


pee ee ee a ee 
Because of the manner in which B was beeing in (8.5) there 
exists a lincar relation, with coefficients a ae ms B between the 


right-hand sides of (8.6) which is satisfied identically in a,_,, By-. 
Hence for all p 


(FF) [a + by — % ay] + Blaby + (w — hay] = 


C C 


[Vv Wy = _ (¥ | Yo _ 
by} (” I Vy + k) -F xB | r a,| Blu k) te 2) a |= 0. 
(8.7) 
The power series will break off with the term with exponent p 
if on replacing a,.., bu, by a,, 6, the right-hand side of (8.6) 1s 
made to vanish. The condition for this 1s that 


Bb, + (= — “ay = 0; (8.8) 


C 


it will be satisfied in virtue of (8.7) if the determinant of the 
coefficients in these two equations vanishes : 


(CNC )u te tet ]-alpn 8 -( eMe]=< 
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or by (8.5) 


v vot 
ope: 
a e Bp CB 


Since the exponent yx with which the series break off must be 
of the form py + ”, where » is a positive integer, we obtain the 
fine structure formula 


1 panes 
a ae / 2 __ 2 
Vea Vg Be a TVR Ot) 88) 


The solution # of our differential equations, for the char- 
acteristic values vy = cE defined by (8.9), is of the form 


e~8r . ye» (polynomial of degree 2 in 7) 


and satisfies the condition that the spatial integral of ||? con- 
verge in the neighbourhood of the singular points r = 0, oo. 
These FE consequently constitute the discrete term spectrum of an 
ion with nuclear charge Ze and having but one electron outside 
the nucleus. If we neglect the small constant « in comparison 
with k, E depends only on n + |k|. This fine structure formula 
further tells us that the two terms with auxiliary quantum 
numbers k and — k, or the two terms with the same 7 and for 


which 1 =7 + * exactly coincide. That this is in fact found 


to be the case has already been mentioned in § 4. Equation 
(8.9) has had a remarkable history. It was first derived on the 
basis of the older quantum theory by Sommerfeld and, at about 
the same time, verified by the experiments of Paschen ; it was 
perhaps the greatest triumph of that theory, next to Bohr’s 
explanation of the Balmer series and his calculation of the 
Rydberg number from universal atomic constants. The new 
quantum theory at first destroyed this beautiful agreement, 
as in its scalar form it led to (8.9) with the half-integral quantum 
number j in place of the integral |k|. Sommerfeld’s original 
formula was only completely re-established with the advent 
of the Dirac theory here discussed. The quantum number Rk, 
which was used in the older quantum mechanics in place of [| 
and which may assume the value 0, has also re-appeared and 
is now supplied with a sign. But on the other hand, the number 
of components in the fine structure is now greater than in 
Sommerfeld’s theory, as in addition to the transitions kR>k—1, 
k + 1 we may now also have k-> — k; this addition 1s also in 
agreement with experiment. 
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Our conclusion that (8.8) was to be satisfied in virtue of 
equation (8.7) for the unknowns a,, b,, assuming that the deter- 
minant of the two equations vanished, fails when both coefficients 
of equation (8.6) are zero: 

v + vp a  pw—k 


cB wet+k  « 
It follows from this that then p = Vk? — @?, or n = 0, and that 
w+k<O0, or R<O0. There actually exist no terms 1 = 0, 
k=-—1,—2,-: +. For the coefficients a,, 0, of the beginning 
term in the corresponding solution, which is at the same time 
the end term, would by (8.6), (8.8) necessarily satisfy the equations 


V-— Vo 
-a, = 0 
c 


(u + k)b, — aa, = 0, ab, + (uw — k)a, = 0, Bb, + 


Or 


“ od _p—-k _v—vw 
( A, etre a cp’ 
and this is impossible because of the condition |p| < v.18 

In accordance with the foregoing we may describe the normal 
state of the hydrogen atom; n=0, k=1 (l=0), as follows. 
We take the quantum number m, which may assume either of 
the values 0, — 1, to be 0. Let a = 0-532 A. be the radius of 
the first Bohr orbit and « = 7-29- 107% the fine-structure con- 
stant. ,, %,; ,', %.’ are obtained by multiplying the radial 
function 

A(r) — eta. yJ1—ot-1 


with the factors 


(1+ V1— 2) +iacos 6, iasin Ge'*| py, pp 
(1+ V1 — a?) —iacos 6, —iasin Be’? | py, Wo. 


We find from these expressions that the probability density ab 
is distributed spherical-symmetrically in accordance with the 


law 
p = [A(r)]?. 


The normalization is here not chosen in such a way that the 
integral of p over all space is unity ; it is actually 


1+ 2/1 — a3 eo ae 
4n(3) TL + 2V1— a). 


We have already seen that in a certain sense the probability 
density multiplied by — e represents the distribution of charge 
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in the atom. Considering the probability current as deter- 
mining the convection of this continuous charge distribution p, 
we find that it represents a circulation about the z-axis with 
velocity «ac sin @ (ac is the velocity of the electron in the first 
Bohr orbit on the older theory). On giving the axis of rotation 
all possible directions % runs through the 2-parameter family 
of characteristic solutions for which n=0, k=1; we may 
take as a basis for this family of solutions the above (m = 0) 
and that for which m= — 1, representing a circulation in 
the opposite direction. 


C. THe PERMUTATION GROUP 


§ 9. Resonance between Equivalent Individuals 


The Hermitian forms Q, which represent in system-space all 
possible physical quantities of a given system, constitute a 
totality 2 within which addition and multiplication is defined. 
If & were reducible we could choose our co-ordinate system in 
system-space in such a way that all Q would be simultaneously 
completely reduced; these individual parts into which the whole 
would be divisible would then each constitute solutions of the 
quantum problem which were merely accidentally joined to- 
gether to form the given solution. In accordance with the 
fundamental Aristotelian postulate of ‘“ nihil frustra’’ Nature 
could hardly be expected to indulge in such a superfluous luxury. 
Hence we propose the thesis that & 1s an irreducible system. On 
introducing as fundamental quantities the canonical variables 
as in II,-§ 11, this assumption contains the requirement that it be 
impossible to choose co-ordinates in system-space in such a way 
that the 2f matrices q,, °° *, 443 Pi °° *) pr are simultaneously 
completely reduced. This postulate 1s to be added to the Heisenberg 
commutation rules as an essential supplement. 

In accordance with Burnside’s theorem [III, § 10], which 
we carry over without scruple from spaces with a finite number 
of dimensions to those with infinitely many, the irreducibility 
postulate allows us to assert that there can exist no linear 
homogeneous relation tr(AQ) = 0 between the components of 
Q which is satisfied for all Q. Since in the domain of the Q’s 
not only is multiplication possible—as presupposed in Burnside’s 
theorem—but also addition, we arrive at the conclusion that all 
Hermitian matrices in system-space are contained in 2. It is 
perhaps desirable to express our requirement directly in the 
form: any Hermitian form represents a physical quantity of 
the system. In accordance with II, § 7 there is associated with 
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each statistical ensemble a positive definite Hermitian form A 
in such a way that tr(AQ) is the expectation of the quantity 
represented by Q. Burnside’s theorem asserts that the equation 
tr (AQ) = tr (A’Q) 
can be satisfied for all Q only if A = A’, or it 1s impossible to 
distinguish between the two statistical aggregates represented by 
the positive definite Hermitian forms only1f A = A’. Inparticular 
it follows from this that the states represented by two rays in 
system space are physically different if the two rays are distinct ; 
this was to be expected, or even required, from the outset. 
These consequences show the naturalness and cogency of the 
irreducibility postulate, from which it can conversely be deduced. 
The states of physical entities I which are fully equivalent, as, 
for example, the electrons in an atom, are to be represented by 
vectors £ = (%,) or rays in the same system-space RR. If two 
such individuals unite to form a single physical system /? the 
vectors of the corresponding system-space ® X R = R? are, 
in accordance with the general rule of X-multiplication, the 
tensors (%,,) of order two. But, by III, § 5, R* is reducible into 
two independent sub-spaces {§?} and [?], the space of anti- 
symmetric and the space of symmetric tensors of 2nd order. 
Physical quantities Q of J/* have only an objective physical 
significance if they depend symmetrically on the two individuals. 
This requirement is expressed in terms of the elements of the 
Hermitian form 


O = Lain vn Xin Kiev 
by the symmetry condition 


Whi. ki’ == Lik, ik’ (9.1) 

On reducing (x,,) into its anti-symmetric and its symmetric 
parts, 

Kix == x{tk} + x(k) (9.2) 

Q is reduced, in virtue of (9.1), into two Hermitian forms in 

x{tk} and x(1k) respectively. For on substituting (9.2) into Q 

we obtain four terms: those in which {R*}, [R?] intersect them- 

selves, and the two in which {Rt?} intersects [R?] or conversely. 


These last two then vanish, for if we interchange the dummy 
indices 2 with k, 2’ with &’ in 


(Q) = 2G in, vx Xie} x(2'k’) 
and then replace 


Vki, ki?» x{ki}, x(k't’) by Qik, ’k’s a{ik}, x(t'k’) 
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we find [Q] = — [Q], or [Q] = 0. The totality of Hermitian 
forms Q which represent the quantities of J? depending sym- 
metrically on the two individuals is therefore not irreducible ; it 
can be reduced in accordance with the decomposition 


RE = {RF + [7] (9.3) 
of the space R®. 

In particular, every possible interaction between the two 
individuals depends symmetrically on them, even when other 
physical elements, such as a radiation field, are also involved. 
Hence if J? is at any time in a state contained in one of the 
sub-spaces {R?} or [M2] it is for all time impossible to get it out 
of this sub-space by any influence whatsoever. Again, we expect 
Nature to make use of but one of these sub-spaces, but the 
irreducibility postulate offers us no clue as to which one she 
has decided on. 

Take as co-ordinates in the system space of the individual 
I the principal axes e; of the energy associated with the char- 
acteristic numbers &;. Disregarding the interaction between 
the two individuals for the moment, the system /* has as energy 
levels E, + E, with characteristic vectors e; X €, = €;,; each 
characteristic number of the type &, + E, appears twice, and 
the corresponding characteristic space is spanned by the vectors 
Qi. and e,,. On introducing the interaction as a small per- 
turbation the two states €,, and e€,, are in resonance with each 
other. Denoting the components of the total Hamiltonian 
function by A(zk, 7’k’), the transformation of the sub-matrix 


ris @ H(1 2, 20 | 


H(21, 12) A(21, 21 
to principal axes, as required by perturbation theory, can in 
the present case be performed in a manner which 1s universally 
valid; we need only to replace the fundamental vectors e49, €2, by 


sles — a), a(t + 21). (9.4) 


Denoting H(1 2, 12) = H(21, 21) by fw and the numbers 
H(1 2, 21) = H(21, 1 2), which must be real in virtue of the 
condition H(1 2, 2 1) = H(21, 1 2) of Hermitian symmetry, by 
ha, the resonance equations become 


ae (v X19 + % %o1) = 0, 


Sade + (a %2 + ¥%q1) = 0 
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‘om which it follows that 


ae = — u(y — a)(X%y9 — Xo), 
ei Sal = — 1(v-+ a)(xy> + %q)). 


aking as initial conditions x,, = 1, x,, = 0 for t= 0, we find 


Myq — X—qy = OTOH, Xe + gy = eT a (9.5) 
| Vis |? = cos? al, | Ns k =: sin? at. 


Ve see from this how the two states @,,, e€,, alternate back and 
oe” 
wrth with the beat period = whereas the components (9.5) 


long the axes (9.4) have always the same constant absolute 
lagnitudes. 

The only characteristic numbers associated with the system 
race {ft} are those of the type E, + E,, each of which appears 
xactly once, but the sub-space [%R?] has simple characteristic 
umbers of the type 2/, 1n addition to these. Hence if Nature 
ecides in favour of {R?} both individuals can never be sim- 
Itaneously in the same quantum state with energy &,—assum- 
ig this energy level for the individual system is non-degenerate. 
hat FE, + E, occurs only once in {R?} and only once in [R?] 
1eans: the possibility that one of the identical twins Mike 
nd Ike is in the quantum state &, and the other in the quantum 
‘ate E, does not include two differentiable cases which are 
ermuted on permuting Mike and Ike; it 1s impossible for 
ther of these individuals to retain his identity so that one of 
rem will always be able to say ‘‘ I’m Mike” and the other 
I'm Ike.”’ Even in principle one cannot demand an alibi 
f an electron! In this way the Leibnizian principle of coin- 
dentia indiscernibilium holds in quantum mechanics,}? 

On passing from 2 to f equivalent individuals / it is not so 
asy to reduce the representation (c)/ of the complete linear or of 
1e unitary group in system-space ® into its irreducible con- 
ituents; we shall go into this matter in the last chapter. 
evertheless we know from III, § 5, that the anti-symmetric 
ad the symmetric tensors of order f with components 


x{kykg > > > Ry}, x(Rykg + + + Ry), 


‘spectively, each yield such an irreducible representation. 

physical quantity Q of the total system J/ which depends 
7mmetrically on all f individuals will be represented by an 
lermitian operator Q, the coefficients g(k,k,-°-> ky; Riko: ++ Ry) 
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4 
of which are unchanged on subjecting k,k,°--k, and kik, ++ kg 
simultaneously to the same permutation. It is evident that such 
an operator always sends an anti-symmetric tensor x{k,k, +: + ky} 
into an anti-symmetric tensor x’: 


MRR ee Ry eda s+ ys Rykty + + + Rya{kiko: + + Ry}. 


Hence the sub-space {/} of anti-symmetric tensors is reduced 
out of the system-space R/ of Jf, determined in accordance with 
the general rule of xX-multiplication, in such a way that if // 
is ever in the system space {/} it remains there forever, regard- 
less of what influences may act upon it. The sub-space [9] 
of all symmetric tensors x(k) of order / can similarly be separated 
out of Rf. The energy level &, + &,+ °° ++ E;, which is 
f!-fold degenerate in R/, appears in {9} as a simple level. Only 
characteristic numbers of this type appear in {R}, but the 
characteristic numbers of [R/] are all numbers which can be 
obtained by summation of f distinct or non-distinct energies E. 

If the system space is 2-dimensional, {8} is only possible 
iffsn. If EF is an n-fold energy level of the individual I then 
the quantum states with energy /£ constitute an 2-dimensional 
sub-space R(E). If it should happen that only {#/} is realized 
in Nature, then in view of the foregoing it would be impossible 
to have more than n individuals of the system I! in the quantum 
state Is. 

The’ reduction of R to {R/} or [R/] involves relationships 
which frustrate any attempt at description in terms of our 
old intuitive pictures with their orbits and billiard-ball electrons. 
But the difficulty enters already with the general composition 
rule, according to which the manifold of possible pure states 
of a system composed of two parts 1s much greater than the 
manifold of combinations in which each of the partial systems 
is itself in a pure state. 


§ 10. The Pauli Exclusion Principle and the Structure 
of the Periodic Table 


One of the most fundamental facts of Nature, the ordering of 
the chemical elements in the periodic table, can be understood 
only with the help of these considerations. We go from one 
atom to the following, which we denote by A, in two steps: 
the first is preparatory and consists in increasing the charge 
on the nucleus by 1, and the second and final step consists in 
adding an electron to the ion At so obtained. To obtain the 
normal state of A this additional electron must be bound as 
tightly as possible, i.e. the energy of the total system A must be 
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a minimum. If we disregard the mutual perturbations of the 
electrons for a moment, although they may be very considerable, 
we might expect to find every electron in an unexcited atom in 
the lowest energy level, 1.e. with principal quantum number n = 1. 
But instead we find the following: The 1 electron of H and the 
2 electrons of He are in the Is orbit, 1.c. they are in the quantum 
state n= 1,1=0. But the next 2 electrons, which are added 
in going over to Ji, Be, are in a 2s orbit, and the additional 6, the 
addition of each of which gives rise to one of the elements from 
B to Ne, enter the 2p orbit. Then follow Na, Mg, each with a 
new electron in the 3s state, the elements from Al to A, the 
additional electrons entering the 3p orbit, etc. These facts 
are readily scen on writing the wave number of the lowest 
S term in the form — R/n?,; in Hl, He, Li the ‘ effective 
quantum number” 2, has the values 1°00. 0°74, 1°59. That 
n, sinks on going from H to He is understandable in view of 
the ‘“ screening ”’ effect of the original electron on the new one. 
We should expect that if the next clectron also went into the 
orbit 2 = 1 the corresponding value of 2, would be something 
like 0°59, but we find instead a number which is greater than 
this by unity. The same occurs on going from Be to B or from 
Mg to Al; the normal states of these atoms are formed by the 
valence electrons entering 2p or 3p orbits because the 2s or 3s 
orbits are already ‘‘ occupied,’ and if the valence electron is 
raised to an s state by excitation, it can only be raised to one 
for which n 23 or n 2 4.* Obviously the essential features 
of the regularities expressed in the periodic table depend on this 
mysterious numerus clausus for the various states with principal 
quantum numbers nx = 1, 2, °° + and on the fact that in conse- 
quence of this the electrons in the atom are added on in definite 
layers or “shells.” Stated more precisely, in an ms orbit 
(n = 1, 2,--+) there is room for but 2 electrons, in an np orbit 
(1 = 2, 3,---) for but 6; in general the situation is described 
by Stoner’s rule: there can be at most 2(21+ 1) electrons in a 
state with quantum numbers n, l. 

On taking into account the duplicity caused by the spin we 
see that this number is exactly the dimensionality of the sub- 
space R(/) in the system-space of a single electron. Neglecting 
the spin perturbation, which is indeed much smaller than the 


* The physical significance of the “‘ true principal quantum number ”’ 
n is contained in these considerations : we think of the term in the Hamiltonian 
function which represents the energy of interaction between the various 
electrons as multiplied by a numerical factor A and let A decrease steadily 
from £ too; this virtual adiabatic process sends each electron into a definite 
hydrogenic orbit with a principal quantum number 7, the ‘ true quantum 
number "’ of the electron. 
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mutual perturbations of the electrons, the energy level assocti- 
ated with this sub-space is 2(21-+ 1)-fold degenerate. This 
degeneracy can be removed by the introduction of the spin 
perturbation and a weak magnetic field; the energy level is 
then broken up into 2(2/ + 1) simple components distinguished 
by the quantum numbers 


goals m=jJ,j—l,°--, ay 


Stoner’s rule led Pauli to postulate the exclusion of equivalent 
orbits: it 1s impossible for two electrons in an atom to be simutl- 
taneously in the same quantum state (n, l, 7, m). This shows 
that #/ is obviously not the system space of the physical system 
If in which f electrons revolve about a fixed nucleus, but that 
the reduction to {9t/} takes place: Nature has decided in favour 
of the reduction to the space of anti-symmetric tensors, at least in 
the case of electrons. In view of the considerations advanced in 
the previous paragraph this principle leads conversely to Stoner’s 
rule.}® 

If the formation of one atom from the preceding one were 
an entirely regular process the occupation of the various states 
would take place in accordance with the following table, the 
lower row of which indicates the number of electrons captured, 
on going from atom to atom, by the orbit immediately above : 


Is; 2s, 2p ; 35, op, 3d ; 4s, Ap, 4d, Af. ye 


2; 2+6; 24+ 64 10; a ey aes | 


This would indeed be the case if we could increase the charge on 
the nuclei by some large fixed amount, for the mutual perturba- 
tions of the electrons could thus be made arbitrarily small in 
comparison with the Coulomb attraction of the nucleus. But 
even a rough calculation shows that these perturbations are 
actually too considerable not to lead to displacements in the 
above table, i.e. to changes in the order {i which the various 
Shells are filled. For example, after the 3p shell is filled, which 
is accomplished with A, the next 2 electrons go into 4s states 
to form K, Ca, and only then do we find electrons entering the 
3d orbits to form Sc, Ti, ++ -. For details consult the books 
by Hund, Pauling and Goudsmit or Ruark and Urey mentioned 
in the Introduction. 

It is not the purpose of this book to report on the extensive 
empirical data of spectroscopy, nor to show how the two main 
principles required to lead beyond the general scheme of quantum 
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mechanics to the interpretation of spectra were wrested from 
this material; I here refer to the introduction of the inner 
quantum number 7 in addition to the azimuthal /, or the spinning 
electron, on the one hand, and to the reduction of Rf to {RS} 
by means of the Pauli exclusion principle on the other. 
Millikan begins his report to the American Philosophical Society 
on ‘‘ Recent Developments in Spectroscopy ” [Proc. Am. Phil. 
Soc. 66, p. 211 (1927)], with the words: ‘* Never in the history 
of science has a subject sprung so suddenly from a state of com- 
plete obscurity and unintelligibility to a condition of full illu- 
mination and predictability as has the field of spectroscopy 
since the year 1913.’’ The theory of groups offers the ap- 
propriate mathematical tool for the description of the order 
thus won. 

The lines of the optical spectrum are caused by quantum 
jumps of the electrons which are most loosely bound. In the 
alkalies Li, Na, K, - + + the one involved is accordingly in the 
state 2s, 35, 4s, - + +. We also understand why their cores 
Lit, Nat, K', +++ are spherically symmetric, and therefore 
why their spectra may be approximately calculated in terms 
of the motion of an electron in a spherically symmetric field ; 
the real reason behind this is the following. That an electron 
has the quantum numbers n, / means that its state 1s in a 
sub-space §, of A= 2(2/+ 1) dimensions. The sub-space 
{R, x R, x - + + x Rp with A factors, as obtained by the anti- 
symmetric reduction of Ry, is l-dimensional and the rotation 
group induces in it the 1-dimensional identical representation ; 
ic. a shell consisting of X electrons 1n the state n, 1 acts spherical- 
symmetrically ; its presence does not increase the mantfold of 
terms. ence the “ closedness’’ of those elements with which 
a shell is completed ; the rare gases, which precede the alkalies, 
are elements of this kind. But we should also expect Cu, Ag, Au 
to have alkali-like spectra, as they contain but a single electron 
in the s state, while all the others are bound more tightly in 
a ‘‘ closed” configuration with an external field which 1s spheri- 
cally symmetric. The valence of the clements must obviously 
find its explanation in these terms; indeed, 1t gave the clues 
which originally led to the discovery of the periodic table. 
But only in recent times have we been able to call on the assist- 
ance of spectra, interpreted and arranged with the aid of atomic 
theory by Bohr and others, and they have verified the principal 
features of the table, while modifying, supplementing and 
Improving its details. 

The consequences of the Pauli principle for the term analysis 
of atomic spectra will be discussed in detail in Chapter V, 
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particularly in $15. We here mention briefly the results for 
the case of 2-electron spectra f = 2. 

Just as the alkalies may be treated as if they were but 
l-electron atoms, in dealing with the alkaline earth metals we 
need only take into account the two most loosely bound electrons 
which occupy an s orbit outside a spherically symmetric closed 
shell. As before, we obtain one singlet and one triplet term 


(nl, n’l’; L) 
whose total azimuthal quantum number L assumes the values 
Lh=l+lU,l4+U—1,:--, |f-T| 


assuming that the two quantum states (7), (#’l’) of the individual 
electrons are distinct. The only difference is that now such 
a term appears only once, whereas before it appeared twice, 
corresponding to a permutation of the electrons. The situation 
is, however, more complicated if (7) = (n'l’). The only singlet 
terms ; 


(vl; nl; 


’ 


L) 


which actually occur are those with even L = 0, 2,- ++, 21 and the 
only triplet terms are those unith odd L -= 1, 3,++-+, 21—1. This 
rule is thoroughly in accord with the empirical data. 

The best-known lines of the spectra are those arising from 
transitions in which only one electron is not in the normal state 
and is jumping between higher energy levels. Flence if one 
of the two electrons (not saying which!) 1s in the normal state 
nN =, | = 0 (mn) = 1, 2, 3, 4, -- + for He, Be, Mg, Ca, «: -) 
we have L = 1 and the two quantum numbers (n, J) suffice to 
determine the singlets or triplets. The lowest S term (L = 0) 
of the singlet system has the principal quantum number 2 = 1p, 
but there 1s no such term in the triplet system ; it begins with 
n=n +1. We find that the lowest S term in such a triplet 
system (which is, as we know, simple), e.g. in the spectrum of 
Mg, actually does lie in the neighbourhood of the second lowest 
S term of the singlet system instead of the lowest. 


§ 114. The Problem of Several Bodies and the Quantiza- 
tion of the Wave Equation 


In this paragraph we depart from our usual terminology 
and denote the number of individuals by » instead of f. We 
first consider more fully the reduction of SR" to [R"], for we shall 
find that although it does not apply to electrons, it does to 
photons. Let H = ||H,,|| be the Hamiltonian function of an 
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individual. The variables d(n,, m2, °° +) of the unitary space 
[Rt] behave like the monomials 


xt ane a (n, + noferer= 12), (11.1) 


Vy!ng! es? 


of degree 1 which are formed from the components x, of an 
arbitrary vector in &; we denote this monomial (11.1), without 
the denominator, by 4(n,, mg, °° *). We shall have occasion 
to use the differentiation formula 


A(x ht se 6) = (yxy) eft s+ doy) fe (tg xp he's dx) to 


In the absence of interaction between the individuals we obtain 
from 
1 dx, 


es Hag Xp = 90 (11.2) 
1 al B 


the equation 


_ : b(1y, No) = 2p (My — 1, 2, °° +, wg + 1,° °°) 
+ tte DHop hlity, tg—-1, °° *, tga ls: s) 
In the sum on the right d(m, — 1, 2g, °° +, te +1, °° +) Is to 


be interpreted as 4(7,, me, °° *) for B= 1; similarly for the 
term with B = 2, etc. We can also write this equation 


lL. 
7 > P(ny, No, 0 t) == Ly gy P(My, Me, + * *) 
+ Dry Hage (0+, tex ~ Let, tpt 1 :). 
a ¢pB 


On introducing the binomial coefficients in accordance with 
(11.1) we obtain as the equations of motion 


Ldiyb(n,, to, ° °° 1 
ty (ry rife a a Dis Het ig ee) 


+S Va (tg+ Diag b(t, ty, ty tp td, s+). (11.3) 
ve 


These equations are of the form 


Pe te Wis. = Sip np (11.4) 
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where the matrices y,, are defined by 


ee ae ae a ee Ng if ny = ny, ee 3 

Noor M1, to, > My, No, ) =F otherwise (1 1.5) 
and for a + 8B 

roe WV n,(g + 1 a 

Nap (M1, 19, ae er nN, No, m8 ‘) = {o al B ) (11.5’) 


where the first alternative holds when all 2’ =n with the ex- 
ception of ny=n,—1, ng=ng+1 and the second in 
all other cases. H is, as it should be, an Hermitian matrix. 
If H is in diagonal form the fundamental vectors forming our 
co-ordinate system are the quantum states of the various in- 
dividuals ; [ys(n,, 122, + +)|? 1s then the probability that there 
are simultaneously m, individuals in the first quantum state, 
n, in the second, etc. On reduction from §* to [§*] it becomes 
impossible to identify the individuals as Mike, Ike, - + - and we 
therefore may not ask for the probability that Mike is in the 
at state, Ike is in the BY, ---. If we have in addition to H a 
perturbation eW affecting the individuals (and symmetric with 
respect to these individuals), then equation (11.3) governs the 
change of the probabilities |y(7, m7, + > -)|® in time. 

The Hamiltonian function H reminds us of the one which we 
obtained in Chapter II, § 13 by quantizing Maxwell’s equations ; 
there the individuals were photons. Maxwell's equations are 
to be considered as the quantum-theorctical wave equations of 
an individual photon. If we replace the photon by an individual 
whose state (x,) varies in accordance with cquation (11.2) we 
are led to a new way of treating the problem of several bodies, 
which we call the ‘‘ method of second quantization ”’ in contrast 
to the ‘‘ method of composition” or ‘* X-multiplication ” de- 
veloped in Chapter II, § 10. In this we consider (11.2) as the 
classical equations of motion of a physical system whose canonical 
variables are the real and imaginary parts q,, pz of x,, and as 
such subject them to the process of quantization.1* We here 
tie on to the development given in Chapter II, § 11. Introduce 
the complex quantities 


l 7 ee eee 
Xe = Vat =r 1px), Le = Jat 1D.) 


into the Hamiltonian function H as independent variables in 
place of qu, Pa; the Hamiltonian equations are then 
AX» OH dz,  .dH 


dt = 3G,’ ad = ’ 3x, (11.6) 
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In order that (11.2) may be considered as the classical equations 
of motion of a system with infinitely many degrees of freedom, 
in accordance with our programme, they must be of the form 
(11.6). But this is in fact the case; the Hamiltonian function 
is then 


H = YHyph,%p. 
a,B 


In quantizing x,, @, are to be replaced by Hermitian conjugate 
matrices x,, X, which satisfy the following commutation rules: 


X,X_— XgX,==0, XaXg— XpXq = 0, | 


ad 1(n= (11.7) 
taky— Fake br = [900 a) | 
The Hamiltonian function H then becomes the matrix 
H = Op Xa Xp | (11.8) 
a, B 


if H is in diagonal form then 
Nee XX 


We are here dealing with an infinite set of oscillators, the in- 
dividual members of which are distinguished by the index « ; 
the energy of the a" is given in terms of the complex co-ordinates 
Mig Ug DY dig Le Kn 

The quantum theory of a single oscillator as developed in 
II, § 3 gives us as the irreducible solution of 


XX — XX = 1, 


where xX, X are two Hermitian conjugate matrices normalized 
in such a way that the energy xx is in diagonal form, the matrices 


x(n, n+ 1) == Vn +1, X(n,n— 1) = Vn; Xx(n, 2) =n, 


all other components vanishing ; the quantum number n assumes 
the values 0, 1, 2, +++. From this we obtain the solution of 
(11.7) by composition : 


| Vin 7] if all xn’ =n 


, ’ 
X(N, Wy 2S Re 114, No, * ° =) — except n:, == he -4. 1, 
lo otherwise ; 


Ps fall n’ =n 


except n, = ma — |, 
0 otherwise. 


X(N, No, te ny, blo, ae ') - 


The products X,X, are of course in diagonal form; x, X, is the 
matrix 7ag introduced above, and (11.8) coincides with (11.4) : 
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the method of second quantization leads to the same result as the 
method of composition supplemented by the ‘‘ symmetric reduction "’ 
of R" to (R"].. But now the number 


Nye Nepean 


of individuals is not prescribed; however H is reduced into 
sub-matrices in accordance with the various values of n, for 
all components H(nyn, ++ °; myn,+ + +) for which n, + ny + 
es est nytn,+-+:+-+ vanish. The total number of photons 
is not conserved, and to this extent Maxwell's equations do not 
fit completely into the quantum-theoretical picture—unless we 
wish to consider “ non-existence’? as a particular quantum 
state of the photon. 

The method of composition remains applicable in the presence 
of interaction between the individuals, provided it is an in- 
stantaneous action at a distance determined by the simultaneous 
values of the canonical variables of the various individuals. 
But it breaks down when, as in the theory of relativity, account 
is taken of the finite velocity of propagation, which led to the 
introduction of continuous fields in the classical theories. The 
difficulty arises from the fact that the wave function w must 
contain the one time ¢ as argument in addition to the spatial 
co-ordinates of cach particle, whereas the theory of relativity 
requires that the proper time of each particle appear as argu- 
ment in w as well as the spatial co-ordinates. The method of 
second quantization shows its superiority in dealing with such 
problems. 

As we have seen, the method of second quantization in 
accordance with Heisenberg’s commutation rules is equivalent 
to a reduction of the system space RK" to [R"]. Since we have 
seen in II, § 13 that this leads to the correct laws of radiation 
phenomena, we must conclude that the behaviour of photons 
corresponds to this reduction. But in the case of electrons the 
reduction is to the space {R"}, and we must now investigate 
to what kind of quantization this corresponds.?° The vectors 


of the unitary space {%"} are the anti-symmcetric tensors with 
components 


x{ Oy, Mo, * °°, On} = Xs Keo) Me eee (11.9) 


in the space , where the one row in the determinant stands for 
the x rows formed in the same manner from 7 vectors y = x"), 
rp), - + + x of ®. We can obtain the totality of linearly 
independent components by restricting the indices by the 
condition 


Oey Oe Oe, (11.10) 
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We now denote (11.9) by d(n,, nm, - + +), where n, = 1 or 0 
according to whether @ appears in the set of indices a,, o%, °° °, 
a, or not; these quantum numbers n, may thus only assume 
one of two values. On replacing a, = « in (11.9) by an index 
B + a, (11.9) vanishes if B is equal to one of the remaining 
indices @,°°**, @,; if Bis different from a, +++, a, it becomes 


x{ Pay "8 8 Ang = + (ny, "8 ty My I, my Np ae 1, aed J, 
the sign -- 1 being (— 1)" where r is the number of indices in 
the set a, ° ++, a, lying between « and B: 


r= Din, 
A 


where the sum is extended over all indices A between « and B. 
We again obtain equations of the form (11.4); (11.5) 1s then 
valid as it stands but (11.5’) is to be replaced by 


} , 
Hv tljg Hay @ eM, tag? © 2) Se Lor O, 


where the first alternative applies to the case in which all 12’ = » 
except n,= 1, n, = 0; n,= 0, n,= 1, the sign being again 
determined in accordance with the above rule. On = writing 
a matrix |la(#")|| in the form 


a(0 0) a(0 1) | 

a(l 0) a(l 1)j 
and introducing the abbreviations 

1 0} || 1 0 

lope ® fo alee 


\ 


we may write 
_ 00) oe 
Naa 1K 1x x9 4 pdt x 


Nepes 1X1 XK [Po X<t a «x 9 tx So" GB). 


where the matrix that is written explicitly in the first equatior 
is in the «® place and those in the second in the «'® and pt 
places respectively. We must now attempt to write these 
matrices in the form xX, X,; this can in fact be accomplished by 
taking 


rem U Xt xs sx Hx |) [xd xd xs] 
Ae (LAL 
Femi xt xe xt xd g| x txtxs +s] 
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the small explicit matrices being in the «" place. x,, X, are 
Hermitian conjugates, and H can now be written in terms of 
them in the desired form (11.8). Instead of the commutation 
rules (11.7) we now have 


X,Xg + XgX,=0,X, Xp + Xp X. = 0, X, Xp 4- Xp X= Hy. (11.12) 


(11.1) is the irreducible solution of these equations by a pair of 
Hermitian conjugate matrices x,, X, which are so normalized 
that X, X, 18 a diagonal matrix. 

In order to show that the equations (11.4) for the vector 
in system-space yield the Hamiltonian equations (11.6) for the 
forms 


X, = Yx.(n;n') b(n) d(n’) and X,, 


n,n! 
we must prove that the formula 
oH 


x, H — Hx, = <= 
a 


employed in II, § 11, holds here as well. We find that it does 
not hold for an arbitrary polynomial H in x,, X,, but that it 
does for even polynomials in general and so in particular for 
the Hermitian form (11.8). For we have, for example, 


X, XX, = 8),Xg — X,Xy Xp = 84, X—p + X, Xp Xy, 
whence 
X,° X,Xp — X,Xp° Xy = 9yn Xp, XH — HX, = YH yg Xz. 
e e e . e ° e B 
On introducing real quantities, 1.e. Hermitian forms, pu, q. 


by 


1 o 1 : 
xX, = 5 (Ve ++ 1p), XX, = 9 (da oo 1px) 


and denoting the set p;, dz; Px, We; °° ° straight through by 
P1; P2, Ps, Ps, °° * We obtain the relations 
Pz =1, PaPp + PePa = 9 (a=) (11.13) 


The p, are not only Hermitian but unitary as well, as can be 
seen from the first of these equations or directly. Here again we 
meet the matrices 

0] 

1 0}? 


which occurred in connection with the spinning electron. 


1 0 


io 
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We have thus discovered the correct way to quantize the 
field equations defining electron waves and matter waves. 
Here again we find, as in the case of the spinning electron, that 
quantum kinematics 1s not to be restricted by the assumption 
of Heisenberg’s specialized commutation rules. 


§ 12. Quantization of the Maxwell-Dirac Field 
Equations ?! 


The field laws arise from a Hamiltonian principle which 1s 
analogous to the Hamiltonian principle of classical mechanics. 
This latter 1s expressed in terms of a Lagrangian function L 
which depends on the positional co-ordinates g,; and their de- 
rivatives q, with respect to time, and asserts that the first 
variation of 


\ Las, q,)at (12.1) 


vanishes when the q,; are assigned arbitrary infinitesimal incre- 
ments 6q, which vanish outside a certain finite time interval. 
This principal yields, on integration by parts, the differential 
equations 

OL OL 


dp; - _ _ 
di -++- io == Q with P: eh ae dy, ie = dq, (12.2) 


Defining 
jal = L -- a4 Pi: 
and noting that 
éL = aL; 84, — DP: 94; 
we obtain for the differential of 7 the expression 


6H = ob, 84; “a 24: OP. 


Expressing H as a function of the g, and the generalized momenta 
p; associated with them, we have 


oH ny 
da. = Li, Se 
qi oP i 
and by (12.2) these are just the Hamiltonian canonical equations 
dq, _ 0H dp, _ oH 
dt dp; dt dq, 


In quantum theory the q,, p,; are operators satisfying Heisen- 
berg’s commutation rules. 
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This reasoning can be carried over without difficulty to the 
case of a continuum, as appears in field theories. On replacing 
for the moment the 3-dimensional space by the 1-dimensional 
interval 0 S x <1 described by the co-ordinate x and assuming, 
for the sake of simplicity, that only one state function g = q(x, 2) 
is involved, the integral (12.1) is then to be replaced by 

i 


\{L, q) dx dt. 


0 
Naturally L may depend on the spatial derivative af or even 


higher derivatives, in addition to g. The continuous variable 
x takes the place of the index 7 and the Lagrangian function, in 
1 


the sense of (12.1), is now the integral \L(q, q)dx with respect to 


0 
the spatial variable instead of LF itself. We first replace the 


continuum by a discrete set of equidistant points defined by 
Ay eee - (i= 0,1,°+:°+,2— 1). The differential quotients with 


respect to x are naturally to be replaced by difference quotients 
with the difference Ax = 1/1, and the integrals become sums. 
In accordance with the outline above we must now set 


_ _ ola, 9d). 
Pi= 7 Ax, 


calculated at the point x =1/n. For the continuum we have 
analogously to set 


p= — 
and H is to be defined by 


1 
H=L+ \dpdx. 
0 


The commutation rules which are satisfied by g, p in quantum 
mechanics cause some trouble. As long as we employ the 
discrete set of points in place of the continuum they are 


g(x) pla’) — p(x’) q(x) = ¥ = es 


where x, x’ run independently through the set z/n and §,,) is 
1 or O according as x’ coincides with x or not. For fixed x’ 
l 


Ag’ ore! = Ole — #') 
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is a function of x which vanishes for all values of the argument 
other than x’ and is there so large that the sum 2'8(% — x’) + Ax 
x 


has the value 1. In dealing with the continuum we therefore 
introduce with Dirac a function 8(% — x’) which vanishes at 
all points x =+ x’ and ts so large at the point x’ that its integral 
has the value I (cf. I, § 7). Of course there exists no such 
function, but it can be “ arbitrarily closely approximated ” by 
a function which vanishes everywhere except in a very small 
interval about x’ and assumes very large values within this 
interval. Only in this sense can we perform the passage to 
the limit Av = 0 and write the commutation rules symbolically 
in the form 


q(x) P(x’) — p(x’) q(x) = 18(x — x”). (12.3) 


A good illustration of the mathematical interpretation of 
this pathological function 6(% — x’) arises in the theory of ortho- 
gonal sets of functions @¢,(x), for with its aid the completeness 
condition may be formulated 


X'b.(x Nee) i aX"), 


This is literally correct as long as x only runs through a discrete 
set of points, but the rigorous mathematical formulation for 
the case of a continuum is given by 


J 


lim (fs 1 $l x u(x) v(x’) dx dx’ = = fu(x) 


where u(x), v(x) are any two continuous functions in the interval 
(0, 1). Hence from the more rigorous standpoint (12.3) must 
be replaced by the equation 


J fie(x){q(x) p(x’) — p(x’) q(x) Jo(x") dx dx’ = i) u(x) v(x) dx 


containing two arbitrary functions u(x), v(x); furthermore, it 
is to be noted that the p, qg in the brackets are first to be replaced 
by approximations p™, q!)—e.g. by the n' partial sum of 
their expansion in terms of orthogonal functions—and_ the 
passage to the limit 2— oo is to take place after, not before, 
the integration. This interpretation offers a sound mathematical 
method of dealing with the relation (12.3). It 1s to be emphasized 
that (12.3) refers to two points of space x, x’ at the same moment 
t, i.e. in a section of the world in whicht = const. ; the arguments 
of g and p are to be written more precisely as (x, ¢), (x’, ¢) re: 
spectively. 
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On applying this general scheme to the action 
W=M+M'+-F, (5.18) 


from which the field equations for the electron and for the electro- 
magnetic field are obtained, we find ourselves faced with a 
difficulty arising from the fact that the Lagrangian function 
does not contain the time derivative of the scalar potential f,, 
for the generalized momentum associated with fy then vanishes 
identically and cannot possibly satisfy a commutation relation 
such as (12.3). We avoid this difficulty for the moment by 
utilizing the principle of gauge invariance to remove f, from the 
expression of the Lagrangian function by setting it equal to 0; 
this device has already been employed in II, § 13. The set of 
independent functions describing the state is then 


pb = (tb), Yo, Ps, 4), j = (fi, So, fs), 


where we have written wz, py, in place of ¥,, Yo. The momenta 
associated with these quantities are then found to be: 7, with 
%, and — FE, with f,. The commutation rules which are to be 
fai in quantizing the field equations are accordingly 


p,(P P’) + ,( P' yah(P ) = 8,¢°6(P— P’) [p,o =1,2,3,4], (12.4’) 
fol P es ') — Ey(P' oP) = 8 yq°3(P—P") (p,q=1,2,3], 12.4”) 


where P and P’ are any two points of the same spatial section 
t= const. We have here taken account of the fact that the 
quantities % describing matter are not to satisfy Heisenberg’s 
commutation rules, but are instead to satisfy those obtained 
by replacing the minus sign which occurs in them by a plus 
sign. These rules must be supplemented by the assertion that 
the Ws, satisfy in addition the equations 


P(P)po(P') + Pol P')p(P) = 0, (12.5) 


and the same for #,; that the f, at any two points P. P’ are 
commutative and the same for the &,; and finally that the 
material quantities %, % on the one hand and the electromagnetic 
quantities f,, E, on the other are kinematically independent, 
and that every quantity of the first kind at a point P commutes 
with every quantity of the second kind at any point P’ (in the 
same section t = const. of the world). 

As in II, § 13, we again consider the whole system enclosed 
in an insulated and perfectly reflecting cavity which is at rest. 
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In order to describe the electro-magnetic potentials we make 
use of a complete orthogonal set of solutions f of 


Ay -+ vf = 0 (12.6) 
in the cavity, which satisfy the conditions 


div f = J normal 


’ 


at the walls. The construction of such a system is readily 
obtained from the Gauss divergence theorem 


J(curl f+ curl g + div f- div g + f+ Ag)dV 
= (CF. curl g], +f, divg) do  (n denoting normal component) 


for the vector [f, curl q] + f div g, | and g being two arbitrary 
vector fields.2* We first determine the scalar functions ¢ = 4, 
which satisfy the equation Ad + A*é == 0 and vanish on the 
walls, and from them construct the vector fields f, = grad 4, ; 
these vectors f, automatically satisfy the conditions above, 
are of course mutually orthogonal and can be normalized in 
accordance with the equation 


f(fa- f)dV = 8yn| == 2 bade’ J. 


We also determine a complete normal orthogonal system f, of 
solutions of (12.6) which are normal to the walls but which 
satisfy the condition div f, = 0 everywhere, not only at the 
walls. The j, are then orthogonal to these jf, and they con- 
stitute together a complete orthogonal system for vector fields 
in the cavity. We may consequently write 


= Lay + APA Ia) 


12.7 
—€= pf. — Lah| ake 
in the section t=: const. The f,, f, are vectorial functions of 
position in space and have as values ordinary numbers, whereas 
the p, g are scalar quantum mechanical matrices which are 
independent of position and which satisfy the commutation 
rules 


qv Pv — Pv GW = 1, QaPpa— Pada = us 


all ¢g commute among themselves and all p among themselves, 
and any p commutes with any q whose index 1s not the same. 
[These rules are perhaps most readily obtained by solving 
(12.7) for the ‘‘ Fourier coefficients’ p, q in terms of integrals 
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of scalar products of f, © with f,, f, and applying the commuta- 
tion rules (12.4).] The energy 


3) («© i (car play 


ow 
of the electro-magnetic field becomes 


1 Se O35 
E5(ae, 7 — 92) + 32m 
We already know the solution of the commutation rules which 
reduces this expression for the energy to diagonal form. The 
individual components of the vector on which the p, q operate 
are distinguished by means of the quantum numbers N,, corre- 
sponding to the »v, and the values of the continuous variables q,, 
corresponding to the A. On setting q,= W«°Q,, Q, 1s an 
operator which affects only the index N, in accordance with 
the equations 


eae: ioe 
OAN,, N,— 1) = 4% O(N, Ny + 1) = fete, 


all other components, corresponding to transitions N,—> N, 
in which N, is neither V, + 1, vanish. N, assumes the integral 
values 0, 1, 2, * + + and can be considered as the number of 
photons of the kind vy. The momentum fp, associated with the 
continuous variable q, is, following Schrodinger, represented by 


a ee 
the operator oa The electro-magnetic energy 1s then in 
A 


diagonal form and, on neglecting the (infinite!) null-point 
energy, multiplies the vector component (N,; q,) with 


SvN, + it (12.8) 
v A 


We thus see how it happens that the electro-static part, which 
is described by the continuous variable g,, is separated off from 
the part due to the radiation, described by the discrete N, 
giving the number of photons of kind vp. 

The w appear in the part of the energy due to matter only 
in combinations of the form J, p,. Consequently it will be found 
advantageous in dealing with electrons to apply the method 
of composition followed by anti-symmetric reduction; we have 
shown 1n the preceding section that this procedure is equivalent 
to quantizing in accordance with the rules (12.4’). Since the 
electro-magnetic quantities commute with the ¢,, , they may 
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here be considered as ordinary numbers. The quantized wave 
equations then refer to a ‘‘ vector ’’ 3 with components 


rT ie — oF cae Vs qa), 

where P,, : * +, Py, are the positions of the electrons and 
Pi, ° °°, Pn are their spin variables, each of which runs through 
the four values 1, 2, 3, 4. We write 2, |, as a column 
consisting of 4" terms; this 2 1s anti-symmetric with respect 
to a permutation affecting the P, and p, alike. G™ = (S%”, 
SY, SY) is the spin vector (S,, S,, S3) operating only on the 
rb index p,, T‘") is similarly the operation on the +” index p, 
which interchanges %,, Ww with uy, 4, and grad is the gradient 
with respect to P,. The part of the Hermitian energy operator 
— F, in the equation 


ld 
ge, ~ Jab = 0 (%» == cl, H = — cF) 


which depends only on matter 1s 


: = i 
5 (S”, Eada (iP) 4 See ei o-) 
r=1 1 y | 1%) o7Fa 
my ET) (12.9) 
r=l1 


and to this must be added the electro-magnetic part (12.8). 
Since we have throughout taken the scalar potential fy = 0 
we have lost the equation 


div € + p=0 (12.10) 


arising from the variation of f,. This equation contains no 
derivatives with respect to time, and consequently represents 
a condition on the state of the field at a moment ¢ = const. ; 
we must naturally take it into account. On substituting the 
value of © from (12.7) we obtain 


21 Adi + p = 0 


and on multiplying with ¢, and integrating over the space under 
consideration 


qa — JedidV =0. 


From the standpoint of quantum mechanics the left-hand side 
of this equation is an operator D,, and the meaning of the 
equation D, = 0 is that only those vectors 4 which satisfy the 
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equation D,3 = 0 are to be allowed. JD, also consists of an 
electrical part gq, and a material part 


Jods dV = \ (qs by ao Bo tbo AF bs Ws + ws, ws), aV. 


The operator D, which is to be applied to 3 1s accordingly 
D,= Ua — 2'bx(Pr). 


The equations D,3 = 0 then assert that all components 
2(P,; N,; qa) of 4 vanish except those for which gq, = D/¢,(P,) ; 
r=1 


we may therefore write the non-vanishing components as 
WPrs Ne) = 21Pr5 Noi Sia(P,)) 


But then 
02 


grad fb = grad z+ DY grad f,(P,) ° 7 
2 1 


is exactly the combination which appears in (12.9). 29% is 
now given by s 

BE pa(P,) dalPs) = EGP, Ps 
where 


G(P, Pi) = LbalP) bal”) 


is the ordinary Green’s function for the cavity. We conse- 
quently obtain the quantum equation 


for %, in which the operator 
1) 


a 3176", grad(”)) + mT} se 


= 


+ SWN, + Vas ES", £(P,)).Q,} (12.11) 


vv r= 


In Dirac’s theory 


as grad) + mI 
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is the energy operator for a single free particle. a«G(P, P’) is 
the classical potential due to the electro-static repulsion be- 
tween two electrons situated at P and P’. The next term 
represents the sum of the energies v of the photons in the various 
frequency states v, and finally the last term represents the 
interaction between photons and electrons by emission and 
absorption. The meaning of cach of the terms from which 
the energy operator (12.11) is constructed is thus apparent. 
The quantum theory had previously dealt with fields, such as 
that which binds the electron in hydrogen to the nucleus, in 
a manner entirely different from that with which it treated the 
field of the emitted radiation ; the first was calculated classically 
and purely electro-statically as an action at a distance described 
by the Coulomb potential, whereas the second was broken up 
into discrete photons with the aid of Bohr’s frequency condition. 
We have now obtained a theorctical justification for this pro- 
eedure which led to good agreement with experiment. 

Our expression shares with classical electro-dynamics the 
disadvantage that it contains the term G(P,, P,) representing 
the infinitely large reaction of the 7 electron with itself, for 
as we allow P’ to approach P, G(P, P’) becomes infinite like the 
reciprocal of the distance PP’. We should therefore replace 
G(P, P) by the finite 7 (P, P) where 


P(P, P’) = G(P, P’) ~ a, 
da- PP’ 
for this amounts to dropping an infinitely large additive con- 
stant from ¥ 9. I(P, P) represents the effect on an electron at 
P of the field obtained by reflecting the field of P in the walls 
of the cavity. (12.11) shows explicitly how the various terms 
of Fy depend on the value of the fine-structure constant «; on 
developing the solution in powers of a we are faced again and 
again with infinitely large terms of the same kind as G(P,, P,). 
The operator ¥, contains singularitics which, at the present 
stage, frustrate all attempts to carry through the theory. We 
may indeed conclude with 2. Fordan that the problem of the 
existence of the electron is solved, but that that of its con- 
Stitution has as yet eluded us. Our equations further suffer 
from the fundamental disadvantage of the Dirac theory that 
the individual spin variables p, assume 4 instead of 2 different 
values. 

There is, of course, nothing to prevent us from quantizing 
the matter waves in a manner analogous to that applied to 
electro-magnetic waves. We should then develop our quantities 
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describing the material field in a series of characteristic 
functions # = yw) (with four components) of the Dirac equation 


|; (©, grad) + mol a + ppb = 0 (12.12) 


which constitute, on imposing appropriate boundary conditions, 
a complete orthogonal system. The general component 2 of 
the vector 3, on which the energy — ¢¥y operates, will then depend 
on the quantum number 2,, which corresponds to the char- 
acteristic values w and which may assume only the values 0 and 
1, and in addition on the numbers JN, of photons of the various 
frequencies v and on the continuous variables q,. But then the 
operators D,, which commute among themselves and with F,, 
are not in diagonal form, and the climination of gq, cannot be 
accomplished as in the above method. 

Instead of introducing a cavity as in the above we may 
employ a rectangular parallclepipedon with the ‘ boundary 
condition ” that all functions are to be periodic functions whose 
periods are the lengths of the sides of the parallelepipedon. 
We can then introduce running instead of standing waves as 
characteristic functions for the clectro-magnetic field; this gives 
rise to a better agreement with the physical picture in which 
a photon corresponds to a homogeneous plane wave. The 
energy and the momenta are then also in diagonal form if we 
neglect the interaction between matter and light. Equation 
(12.10) then causes some difficulty, as its right-hand side 0 
must be replaced by the constant mean value of the charge 
throughout the entire space in order that a periodic solution 
be possible. On taking account of protons in the theory this 
will automatically correct itself, as the total charge will then 
be 0. 

The dynamical law allows only those quantum jumps of the 
particles in which one 2, falls from 1 to 0 and another 7, jumps 
at the same time from 0 to 1. Consequently the total number 
of particles 2g, and therefore the charge, remains fixed ; hence 


that portion of the dynamical laws in which the total number 
is a given finite n is separated off from the remaining portion 
and intercombinations between the two do not arise. Dirac 
has proposed to interpret the presence or the absence of a proton 
in the state of positive energy p as the absence or the presence, 
respectively, of an electron in the corresponding negative energy 
state — w; our laws will then include protons as well as electrons.** 
Remembering that the numbers n, = 0, 1 were at first intro- 
duced mercly as an arbitrary index indicating the rows of a 
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matrix, there is nothing to prevent us from replacing the numbers 
n_, for negative —p by nj =1—n_,, keeping nt =n, for 
positive p. The theorem of the conservation of charge is then 


Sani — Sinz = const. (uw > 0). 


But we thereby alter the content, as well as the notation, of 
the theory ; we are now interested in that part of the dynamical 
equations in which only a finite number of 2, with positive pu 
are different from 0 and only a finite number of n, with negative 
pe are different from 1! The quantum jump of an electron 
between positive and negative energy levels, which was so un- 
desirable in the Dirac theory as formulated in the previous 
section, now appears as a process in which an electron and a 
proton are simultaneously destroyed and as the inverse process. 
The assumption of such an occurrence, for which our terrestrial 
experiments offer no justification, has long been entertained in 
atrophysics, as it seems otherwise extremely difficult to explain 
the source of the energy emitted by stars. 

However attractive this idea may seem at first, it 1s certainly 
impossible to hold without introducing other profound modi- 
fications to square our theory with the observed facts. Indeed, 
according to it the mass of a proton should be the same as the 
mass of an electron; furthermore, no matter how the action 
is chosen (so long as it is invariant under interchange of right 
and left), this hypothesis leads to the essential equivalence of 
positive and negative electricity under all circumstances—even 
on taking the interaction between matter and radiation rigor- 
ously into account. 

Hlaving now quantized the field equations, we must return 
to the question of how the constituents MM, M’, F of the action 
behave under the substitutions (6.12), (6.13), (6.14). The first 
two substitutions, which we may call (a) and (b), have exactly 
the same effect as before. But the third substitution (c), 
which sends the components of % over into the components 
of &% or their negative, now affects M and M’ differently, for 
y~ and # are no longer commutative with respect to multiplica- 
tion-—they are, in fact, almost anti-commutative. From this 
it is found that M, M’, F behave under (c) in exactly the same 
way as they do under (0), 1.c. they are multiplied by the signs 
— 4 respectively. Hence past and future play essentially 
dipeen roles in the quantized field equations ; we find no sub- 
stitution which leaves these equations unchanged while reversing 
the direction of time. It seems to me that we have thereby 
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reached an extraordinarily important goal of physics. We 
can now obtain the substitution 


fe > —fa (a = 0, 1, 2, 3) \ 
py > on Ba, be > Ps, b> ws, by-> — py 


on combining (a), (b) and (c); this substitution neither affects 
the co-ordinates nor disturbs the quantized wave equations. 
In view of Dirac’s theory of the proton this means that positive 
and negative electricity have essentially the same properties 
in the sense that the laws governing them are invariant under 
a certain substitution which interchanges the quantum numbers 
of the electrons with those of the protons. The dissimilarity 
of the two kinds of electricity thus seems to hide a secret of 
Nature which lies yet deeper than the dissimilarity of past and 
future. 


§ 13. The Energy and Momentum Laws of Quantum 
Physics. Relativistic Invariance 


In quantizing the wave equations the spatial and temporal 
variables were treated so differently that the relativistic in- 
variance of the resulting laws might seem to be open to serious 
doubt.. But a thorough investigation due to Hetsenberg and 
Pault reassures us on this point.2® We carry through these 
considerations on our action principle—but in such a way that 
the general validity of the argument may be readily seen. At 
the same time this offers an opportunity to discuss the meaning 
of the quantization more thoroughly than we have done hitherto. 


I, The Energy and Momentum Laws of Quantum Physics. 


We begin with the 4+ 3+ 3 operators x, f,, E, which 
are functions in 3-dimensional space satisfying the commutation 
rules (12.4) and the supplementary rules there set forth. There 
exists one, and in the sense of equivalence only one, irreducible 
solution of these conditions. From it we obtain the energy 
density ¢) defined by (6.5), (6.6) and integrate it over all of 
space : 


Fo = Jay. (13.1) 


We next construct the ‘‘ commutator ”’ 


5D = [Fo, P] = (FH — #F,) 
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of an arbitrary operator ® with ¥). Consider the result of this 
for the particular operators ® = y,, f,, E,; it should be possible 
to evaluate these commutators using (12.4) and the supplement- 
ary rules alone ;. if one of the quantities involved appears as a 
derivative with respect to a spatial co-ordinate it should be 
transformed by integrating (13.1) by parts—or by deducing 
commutation rules for it from (12.4) in terms of appropriately 


. Or, 2 
defined derivates of the 6 function. If ~~ that process 
0 
involving only differentiations with respect to the spatial vari- 
ables, but which coincide with the derivative with respect to 
time in virtue of the Maxwell-Dirac field equations, we find 


op, = i 7 if) Yo ofp = Oe oI ae Gis 


IX Ox, 


Ps 9 

Ol) = a (13.2) 

We now drop the normalization fy = 0. It follows from these 

equations that 6@® for any gauge invariant operator ® coincides 

with its time derivative as defined in terms of its spatial deriv- 

atives by means of the field laws. We may therefore replace 

the Maxwell-Dirae field equations by the quantum mechanical 
dynamical law 


i Oe (13.3) 
0 


4 represents the probability state of the physical system (pure 
state!) at the time x); 1t 1s a vector of that veetor-space in which 
our operations take place. The fundamental concepts here 
involved are contained in the general programme of quantum 
mechanics as set forth in II, § 7. The ‘ density of electricity 
at the point P'” is, tor example, represented by the operator 
p == db, + + + which is independent of time. The changes 
in the probability distribution for this physical quantity in 
course of time are due to the changes in the state 3 and not to 
changes tn p itself; the rule for the calculation of this probability 
distribution from p and 4 is given in the general programme 
referred to above. The same remarks apply to any gauge 
invariant quantity ®. Jlowever, it is more desirable to con- 
sider the ‘‘ density of electricity’ (without specifying either 
time or position) as a fixed physical quantity represented by a 
definite operator p, and to ascribe the variations in its prob- 
ability distribution in time and space to changes in the prob- 
ability state 3 considered as a function of the spatial co-ordinates 
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X41, %q, X3 tn addition to the time x». We should then expect to 
find four equations 


1 03 
oe Fu% (a =O, 1, 2, 3) (13.4) 
in place of the one (13.3) in which the operators 
¥, == [2 aV 


are those representing energy and momentum. Only now that 
we have formulated the general scheme of quantum physics 
in a manner which is symmetric with respect to the spatial 
and temporal co-ordinates, as required by the theory of relativity, 
can we consider it as complete. In order to determine the 
mean value of a quantity such as the electric density p we must 
assign to the spatial co-ordinates %,, %), x3, on which the operator 
p depends, any definite values x} (e.g. 0). The spatial com- 
ponents of equation (13.4) tell us that the replacement of (x9) 
by a neighbouring point (x? + dx,) amounts to the same thing 
as subjecting the normal co-ordinate system in system space, 
to which the vectors 3 are referred, to the infinitesimal rotation 


1(F, dx, + Fadx_ + Fz x3). 


We must not forget that the equation (13.3) is not equivalent 
to the complete set of field equations, for we have omitted the 
one 


o(P) = div€E+p=0 


which does not involve differentiation with respect to time. We 
must therefore restrict ourselves to vectors 3 which satisfy all 
the equations 

a(P), = 0. (13.5) 


These equations define a linear sub-space ®, of the original 
system-space t. The operators o(P), o(P’) associated with any 
two points P, P’ of space are commutative : 


o(P) o( P’) — of P’) o( P) = 0. 
It is of prime importance that o(P) commute with Fo, 1.e. that 


dag = eg ez Gj g) = 0 


1 

that this is the case follows from the fact that the equation 

oe = 0 is a consequence of the remaining field equations in 
0 

the classical field theory, and consequently—independently of 
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our field equations—we may conclude that the gauge invariant 
operator o satishes the equation do = 0. This commutativity 
of o(P) and F, guarantees that the infinitesimal rotation 1¥ydx, 
of system-space during the time interval dx) does not carry the 
vector 3 iying in the sub-space St, out of R,. 

Continuing our programme, we now set 


= dV 
and investigate the ‘‘ commutator ”’ 
dD ie Rar ®) 


of an operator ® with ¥,; we shall denote this commutator by 
6, whenever confusion might arise between it and the commutator 
6 = 6, with ¥). We find the equations * 


By = (52. + goes ify = Hy = 2 2h.) 


ox, Wy 
; VE E VE 
Of, peer ( : a > 4 p) —— os — Od, dF, — *, oer | 
OX = OX, ox 


(13.6) 
From this it follows that for any gauge invariant quantity ® 


ID 7 
we have 6@ = ORAL taking the equation o = 0 into account. 
1 


H{[ence the way in which gauge invariant quantities depend on 
the spatial co-ordinates can in fact be described as we predicted : 
the operators representing them are constant, but the vector 
4 representing the probability state varies in space in accordance 
with the equations (13.4) for « = 1, 2, 3. 

That the four equations (13.4) are consistent also follows 
from these considerations. In the first place we have 


6,0=0 or o(P)F, — F,0(P) = 0 
in the entire space R; this follows from (13.6). In the classical 
field theory the differential conservation theorem 
ae a =) i 
OX OX, WX Xs 
is a consequence of the field equations. Since {] is a gauge 


invariant, it follows that after the quantization the operators 
satisfy the relation 


dt dt? ny 
5,,° & see nt) = 0 
off + oe va V4 a, 


* In contrast with (6.2) we now employ the letter 5, without the factor 
1/x, as an abbreviation for curt f. 
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in the space &, defined by (13.5). Integrating over the space 
X» = const. we obtain 


Sy) dV = 0 or $F, — F1 Fo = 0. (13.7) 
[The equation which takes the place of (13.7) for the entire 
space §t is 
FoF: — Fi Fo = joF dV. 
Furthermore, 
yb 


me, | St O. {? 
iy 2t3 


in R,, and on integrating this over space we find 
8, fa — () Or F ots Pe Fass == (). 


We thus see that the operators ¥, are commutative in R,, and 
consequently equations (13.4) possess one and only one solution 
3 when the initial value of 4 (i.e. at the origin of the space-time 
co-ordinate system) is a given vector in ®,. 


Il. Relativistic Invariance. 


On transforming from the normal co-ordinate system x, in 
space-time to another x, by means of a Lorentz transformation 


3 
A : X x = 2 Pap XB 
—0 


the solution of the equations 


— 


eee aig (13.4’) 


10K, 
is, as we shall show, obtained from the solution of (13.4) by 
means of a unitary transformation U induced in system-space 


by A. That is, there exists a unitary transformation U such 
that 


ieee a, aoe 
-d(U3) = (SFadxs)(US) 
is satisfied in virtue of (13.4) : 
Oe SF ah es Fa 
ax « B 


or 


US, = 2 Opa Ip” U. (13.8) 
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We could also say that (13.4’) have the same solution 3 as (13.4) 
but that the normal co-ordinate system employed in system- 
space has undergone the unitary rotation U, for the vector U3 
has the same components with respect to the new co-ordinate 
system as § had with respect to the old. We are only able to 
give the transformation U explicitly for infinitesimal A : 


1 
lonpl| = 1+ [orgs U=1+ 58M, 


The equations (13.8) which are to be verified are then 
aS 90 Ba = (oM, Talk 


In particular, the operators in system-space which correspond 
to infinitesimal rotations in physical space are, as we have 
long known, those representing moment of momentum; that 
6M corresponding to the infinitesimal rotation D, ° 


Oo 0, OF p= 0) Oi We, 0X72 4 (13.9) 
about the x,-axis is the x;-component of moment of momentum: 


(M, == ) May = | (xrol8 — xg(Q)aV. (13.10) 


The infinitesimal Lorentz transformations which actually repre- 
sent a re-partitioning of the world into a new space and a new 
time are dealt with in exactly the same manner; it will suffice 
to consider as typical of such transformations 

dX = M1) bX, = Xo; OX = 0, OX3 =a (0. 


The 6M associated with this transformation ts 
My = fx, dV + Saat av; 


the second term, which vanishes for x) = 0, can be omitted, 
for we have already shown that ¥, commutes with all ¥,. This 
term does not fit into the present scheme, in which all the 
operators are functions of %,, ¥%2, ¥; alone. Our problem is thus 
reduced to showing that in R, 


[M 93, Fa| ao 0, 0, Lee as ge —_ (13.11) 
[Myo, Fa] = ears eee 0, 0 a o 2, es (13.12) 


Furthermore, the invariance of equations (13.5) which define 
the sub-space ,, will be proved by showing that the equations 


[Mo3, 0] = 0, [Myo, o] = 0 (13.13) 
hold in the entire space ®t. 
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In order to prove (13.11) we make use of the identities 
0 , 0 
[2a — lay —~0 [a=1, 2,3). 
Xx 


Introducing the Kronecker 6,,, the integrand may be written 


dtg dtp 
(8.2 t§ — Sys fe) + (4 — %3 =), 
X 


In consequence of o = 0 and since t = 2), ¢3 are gauge invariants 
the operations 


— may be replaced by 3,f = (Fa 4, 


whence 
(8.2 73 — 8x3 Fa) + 8, (x2 (3 — X3te)dV = 0 
or 
OM, = (Fa, Mos] == 8a3 Fo — Onn Fy = [ae = 1, 2, 3]. 
In the classical field theory the conservation law 


y 0(%_t3 — Xt) 
a = 0 OX» 


== 0 


is a consequence of the field equations, whence on quantizing 


Seas a) 2G 


a=1 OX, 


holds identically in ®,. Integrating over the whole of physical 
space we obtain 


89M oe, = [Fo, Mas] = 0; 
equations (13.11), 1e. 


[ Mos, Ful = 829 F3 — Oa3 Fo [a == 0, I, 2, 3], 


are thus completely verified. 
The relations (13.12) are obtained in an analogous manner 
from 


ps to) 


ae. dV =0 [for « = 1, 2, 3] 


and from the equation 


{So(xs #8) + QdV = 0 
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which parallels the conservation theorem 


= fo -+ to tt) ay =, 

dX 
of the classical] field theory. 

We should expect the operator functions expressed by the 
b,, fp, Ep, depending on the spatial co-ordinates, to be in- 
variant if we associate with an infinitesimal rotation of the 
spatial co-ordinate system an appropriate linear transformation 
of the components ~, among themselves and of the vector 
components f,, &,, and at the same time subject the normal 
co-ordinate system in system space , to the corresponding 
unitary transformation. In formule: We expect the process 


6D = [M25, P} 
to yield the equations 


| nee 
of, = Of» a (S50 fs ae 803 fo), 
OE, oe Of, a (S52 Ey al Ons E,), 


where we have written 


ID IP 
dee a ey 


But we find by direct calculation that 
; ; ee 
Os = O' + 1(X2 fg —- X5.fo)Ys — 5 
Of, = 2H, + %5Hy, of, = — %_ Hy, 8fp = — x3 Ay, 
bby = OE, + by. E3 + X30) — 83(E_ + X29). 
We first observe that these equations yield 


independently of the condition o=0. On introducing the 
condition o = 0 we find from these equations that gauge 1n- 
variant quantities ® exhibit the expected behaviour. The 


second of the equations (13.13) can be obtained by an analogous 
computation. 
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D. Quantum KINEMATICS 


§ 14. Quantum Kinematics as an Abelian Group of 
Rotations 


If we consider the operators 2p, 1g as infinitesimal unitary 
rotations of the ray field in system space, then Heisenberg’s 
commutation rules [II, (11.4)] assert that these rotations are 
commutative ; consequently they generate a 2f-parameter 
Abelian group, where f is the number of degrees of freedom. 
Let us therefore investigate the properties of Abelian groups 
of unitary rotations in the ray field of n-dimensional space! 
On introducing a gauge as in III, § 16, to each such “ rotation ”’ 
there corresponds a transformation of vector space with matrix 


A and between any two matrices A, B there exists an equation 
of the form 


AB = eBA. (14.1) 


This equation is possible only if ¢ is an x root of unity, for on 
evaluating the determinant of both sides we obtain e* = 1. 
From (14.1) we obtain by mathematical induction 


A®B =: e® BA* 
AB! -= BA, } oo 
fork, l= 1, 2, 3,-+:+. On combining these two equations by 


applying the second to A* and B instead of A and B we find 
the general rule 


AtB! = el Bl Ak, (14.3) 


Taking k = 1 in (14.2) we are led to the equation A"B = BA”; 
if the Abelian rotation group 1s irreducible Schur’s fundamental 
lemma allows us to conclude that since A" commutes with all 
clements B of the group it must be a multiple of the unit matrix : 
A" ™~ 1. The order of any element of an irreducible Abelian 
rotation group in n dimensions 1s consequently a factor of n. 

An f-parameter continuous rotation group is generated by 


an f-dimensional linear family g of infinitesimal unitary corre- 
spondences 


o,Cy ta,Cgot-*++a,Cy (14.4) 


in terms of a basis formed by any f independent elements 
Cy, Co, + + +, Cy, of the family. The numerical parameters 
31, %, °° *, Gy may assume all real values. Setting o, = a; dr 
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and reiterating the infinitesimal transformation (14.4), we find 
that at ‘ time ’’ 7 the resulting transformation is 


U(o,, To, °°, 0,) _— ems talatiers + ofly (14.5) 


where we have replaced a,;7 by o;. U runs through the entire 
group, which is now expressed in terms of the parameters oa. 
If the group of unitary transformations of the vector space is 
Abelian the C, must satisfy the conditions 

CiCp C0, = 0. (14.6) 


From this it then follows that all the elements (14.5) of the 
group are mutually commutative, for if 4B — BA = 0 we have, 
as in the domain of ordinary numbers, 


eB —. pAtR 


The parameters o in (14.5) are added on composition : 
Uo, +++, o)U(oy, +++, of) = Uloy +04, + + +, 0 + of). 
[f, however, only the rotations of the ray space are commutat- 
ive, we find in place of (14.6) conditions of the form 
C16. = 56,6. 4cua. 


where the c,, constitute an anti-symmetric system of real numbers. 
The commutator of the infinitesimal transformations with 
matrices 
A=o,C,+:::+oa,C, Berney tes: 4+7,€; 
is 
AB — BA = 1)'c,,0,7,° 1. 
MV 
We shall refer to the anti-symmetric form 
DOr = 16.7) 
iM, Vv 
as the commutator form ; it is invariant under change of basis. 
On writing 1 + 4 1 + : in (14.3) in place of A, B and allowing 
k=l=m- oo, we find that the commutator of any two 
elements U(o,, 09, °° *, oy) = U(e) and U(r) of the group ts 
U(a)U(r)U-*(a) U(r) = e[h(o, 7)} + 1. (14.7) 


If the rotation group is irreducible a fixed U(a) can only 
commute with all U(r) if it is a multiple of the unit matrix, 
i.e. if all its parameters o vanish. From this we conclude that 
the commutator form is non-degenerate, 1.e. that it cannot 
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vanish identically in 7; for a fixed set of values o,, unless all 
o, = 0—this amounts to the same as the condition |c,,) + 0. 
Such a form exists only if the number f of variables is even, in 
which case it can, by appropriate choice of the basis (i.e. by 
transforming the variables o, and 7, cogrediently under an 
appropriate transformation), be reduced to the canonical form 
in which the matrix ||c,,|| is decomposed into 2-rowed sub-matrices 


0 1 
—l 0 


arranged along the principal diagonal.* It is then desirable to 
write 2f in place of f and to denote the ‘‘ canonical basis’’ so 
obtained by 


1h, 80, (ess, 2, Om 
and the corresponding parameters by o,, 7,. The factor 7 has 
been introduced in order to express the results in terms of 
Hermitian operators P,, Q,. The basic elements then satisfy 
the commutation rules 
(P,Q, =< Q,P,) = 1, (PQ, — OP) = 0 
for w + v and 
PP fl 9, 0,2, — 2,20, = 9 
for all p, vy» The elements 


U(c) = e(o,P, + o2P, +++ ++ a,P,)_ [e(x) = et? 


then constitute an f-parameter Abelian group of unitary (vector) 
correspondences, as do also the 


V(r) = e(7,0, + 7202 +° + + + 7,Q,). 


But the commutator of elements U(c), V(r) belonging to these 
two sets, respectively, ts 


Ula)V(7)UMo) V(r) = eoyry + 8 + yrs) h 
We have now carried our development to a point where we 
can profitably return to the considerations of II, § 11. In 
the case of a system with one degree of freedom in classical 
mechanics any physical quantity associated with the system 
is expressed mathematically as a function f(p, q) of the canonical 
variables p, gq. In making the transition to quantum mechanics 


we had previously restricted ourselves to polynomials in p, q. 
But the Fourier representation 


4-00 
(p, 4) = J felop + 79) &(o, 7) dodr (14.8) 


* See Appendix 3. 
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of a function fis applicable to a much larger class of functions : 
this integral need not be interpreted literally, the essential 
point being that it represents a linear combination of the simple 
functions e(ap + 7q). On considering 1p, tq as infinitesimal 
unitary correspondences in ray space which are commutative 
in accordance with the relation 


(pq — 9p) = 1, (14.9) 
e(op + 7q) runs through the group generated by them. If we 
now consider €(¢, 7) as the components of an element in the 
resulting group algebra, then (14.8) is its group matrix in the 
representation obtained by associating with (o, 7) the unitary 
transformation e(op + 7q). This group matrix is Hermitian if 
the element is real, 1.e. if 


E(o, tT) = €(— o, — 7). 
A quantity f is consequently carried over from classical to 
quantum mechanics in accordance with the rule: replace p and 
qin the Fourier development (14.8) of f by the Hermitian operators 
representing them im quantum mechanics. In particular, the 
derivatives of f are represented by 
+ 


‘ 
fy = if Se(op + 7q) + 0 &(o, 7) dodr, 
00 
{ic i| {e(op + 7q) +7 &(o, 7) doar. 
= 
On letting U(r) in (14.7) again in infinitesimal we find, with 
the aid of the commutation rules (14.9), that 
Pp elop + 7q) — e(op 4 tq) * p = 7+ elop 4- 7q), 
q:elop + 7q) — e(op + 7q) + q == —a- e(op + 79). 
We therefore have in general 
fp=q'f—f.q, —if~=p-f—f-p 
as required in order that the Hamiltonian equations 
dq dp 
one ap te 
be equivalent to the quantum-theoretical equations of motion 
for the vectors of system space 
We have thus found a very natural interpretation of quantum 
kinematics as described by the commutation rules, The kine- 
matical structure of a physical system ts expressed by an irreducible 
Abelian group of unitary ray rotations in system space. The real 
elements of the algebra of this group are the physical quantities of 
the system ; the representation of the abstract group by rotations 
of system space associates with each such quantity a definite 
Hermitian form which “represents” it. If the group iS con- 
tinuous this procedure automatically leads to Heisenberg’s 
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formulation ; in particular, we have seen how the pairs of 
canonical variables then result from the requirement of irre- 
ducibility, whence the number of parameters in such an irre- 
ducible Abelian group must be even.*6 

If one of the canonical co-ordinates, say q, is a cyclical 
co-ordinate with period 27, then all quantities of the physical 
system are represented by periodic functions with period 27. 
Consequently the only values assumed by the parameter 7 
associated with g in (14.8) are multiples of 27 and the integral 
is to be replaced by a sum. In such a case we are no longer 
dealing with a continuous group, but with a mixed (continuous- 
discrete) group. 

Our general principle allows for the possibility that the 
Abelian rotation group is entirely discontinuous, or that it 
may even be a finite group. Thus we have discussed in III, 
§ 16, a group of order 4 and an irreducible ray representation 
% of it in 2 dimensions. That such groups actually occur in 
Nature is shown by the fact that the group we have just men- 
tioned characterizes the kinematics of the electron spin dis- 
cussed in § 4. It can be readily shown that % is the only 
irreducible representation of this group, and that it is in fact 
the only irreducible 2-dimensional group of unitary rotations in 
ray space. These results emphasize the remarkable nature of 
this simplest case. The quantization of the problem of several 
electrons discussed in § 11 also falls within our general scheme. 
In dealing with it we are interests in that Abelian group whose 


basic elements p, (a = 1, 2, , 2f) are all of order 2; such 
a group consists of the totality sf the 4f different clements 
PIPE °° * Poy (Na = Lor 0). 


The gauge can be so chosen that the corresponding unitary 
matrices p, in the irreducible ray representation in 2/ dimensions 
satisfy the equations 


p2=1, PsP. = — PuPp («+ 8B). (14.10) 


The kinematics of the spinning clectron is described by the 
simplest case f = 1 of this representation, 

Because of these results I feel certain that the general scheme 
of quantum kinematics formulated above is correct. But the 
field of discrete groups offers many possibilities which we have 
not as yet been able to realize in Nature; perhaps these holes 
will be filled by applications to nuclear physics. However, it 
seems more probable that the scheme of quantum kinematics 
will share the fate of the general scheme of quantum mechanics : 
to be submerged in the concrete physical laws of the only existing 
physical structure, the actual world. 
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§ 15. Derivation of the Wave Equation from the 
Commutation Rules 


We now show by actual construction that there exists but 
one irreducible ray representation (excluding the identity) of 
a 2-parameter continuous Abelian group: namely, that one 
which leads to the wave equation. 

We obtain our 2-parameter continuous group as the limiting 
case of a finite group with 2 basic elements ; our proof Is rigorous 
only insofar as the validity of this limiting process is admitted. 
Let A, B be two commutative rotations of an n-dimensional 
unitary space. On introducing the gauge we have an equation 
between their matrices: 


AB = &BA, (14.1) 


in which, as we know already, ¢€ is an n'" root of unity. The 
system consisting of the two matrices A, B shall be irreducible. 
Let their commutator, the number ¢, be a primitive m" root of 
unity, i.e. €" 1s the lowest power of € which is equal to 1; m is 
then a factor of ». The orders of the rotations A, B are also 
factors of 2: A*™~1, B*—~ 1, so the gauge may be chosen in 
such a way that A"= 1, B? = 1. Let B be reduced to diagonal 
ferm by an appropriate choice of our normal co-ordinate system ; 
the elements b, in the main diagonal are then all 2" roots of 
unity. Equation (14.1) then yields the following conditions on 
the elements of A =: |la;,|| : 


ot on £054. (15.1) 

We divide the indices 2 and the corresponding variables x; 
into classes in accordance with the rule that 7 and k belong to 
the same class if the quotient b,/b, is an m‘" root of unity, Le. 
a power of «. That this process really results in such a division 
into classes is shown by the fact that if ,/b, and b,/b, are powers 
of e, then b,/b, is also. By (15.1) a, == 0 if ¢ and & belong to 
different classes; hence the matrix A is reduced in accordance 
with the division of the indices into classes. But in view of 
the assumption that the system A, B was irreducible there can 
therefore exist but one such class. 

Having established this result, we now proceed to a finer 
division into classes: 1 and & shall now be considered as belonging 
to the same class if 0; = b,. We arbitrarily choose as the first 
of these classes that one for which 0; = 6 and let the second 
consist of those for which b, = € 6, the third with b, = e7b, - > +, 
the m'® with b, = e""1b; this exhausts the set, for the (m + 1) 
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class b; = e™b coincides with the first. Let the variables be 
arranged and numbered inthis order. It then follows from equa- 
tion (15.1) that all sub-matrices (7, k) of the matrix A are empty, 
i.e. @;, = 0, unless their row index 2 and their column index 
k belong to successive classes. The matrix A then has the 
form indicated in Fig. 3, in which all clements in the non- 
shaded portions are zero (and we have taken m= 4). The 
shaded portions are occupied by the sub-matrices A), A®), 
-+ + Alm), Since A is unitary the sum of the squares of the 
absolute values of the elements of a row or column is 1; the 


same must therefore also hold for the rows and columns of 
each of the sub-matrices. The sum of the absolute values of 
the squares of all elements in A® must then be equal, on the 
one hand, to the number of rows and, on the other, to the number 
of columns; the rectangle A) is consequently a square, and 
the number of indices in the second class is equal to the number 
in the first class, say d. By the same argument we sce that 
the number of individuals in each of the m classes 1s d, and hence 
n == md. The figure is to be corrected accordingly ; each of 
the shaded matrices is now unitary. On subjecting the variables 
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of the first class to the unitary transformation with matrix 
A®)* the sub-matrix A@ is reduced to the d-dimensional unit 
matrix. This normal form is undisturbed by a unitary trans- 
formation affecting the variables of the first set and the variables 
of the second set alike; we can therefore reduce the second 
sub-matrix to a multiple of the d-dimensional unit matrix, and 
so on through the (m — 1)**. The normal form so obtained is 
unchanged on subjecting the variables of each class to the same 
d-dimensional unitary transformation ; we may therefore choose 
as this last transformation one which reduces A™) to diagonal 
form. But the matrix A is then decomposed into d-sub-matrices, 
as can be seen by renumbering the variables, taking first the 
first members 1n each set, then the second, etc. The irreduci- 
bility assumption then tells us that there can be but one member 
in each set: d= 1,n =m. Our matrices are now in the normal 
form: 


0 1 ler | 
0] 

A= 01 = ea ' 
a000°--:90 | gent ro | 


ert l 


all elements not explicitly indicated are zero. The exponents 
in B are n successive integers and € is a primitive n™ root of 
unity. Finally, the equation A” = 1 yields a = 1. We number 
the variables from r on and take indices which are congruent 
mod. 2 as equal; the two correspondences are then 


, , i. 
On reiteration we find 
td 4 
AP? Vic ay, De Ape eee (15.2) 


The transition to continuous groups is now accomplished by 
passing to the limit 2—> oo. Let the basis 7P, 7Q of the con- 
tinuous 2-parameter Abclian rotation group be normalized in 
accordance with (14.9). We identify the matrix A of the above 
considerations with the infinitesimal e(€P) and B with e(nQ) 
where € and 7» are real infinitesimal constants. Then e(¢P) = 
A*, e(7Q) = B' when in the limit s€-> 0, M-7. € 1s now 
e(En) and é*! ==: e(Ekr). e(7Q) represents the physical quantity 
e™?; the values which it may assume are given by e’,) where 
71s real and & runs through all integral values. In other words, 
the quantity g may assume the values kR&; q may assume all 
real numbers from — co to -+ co. (Of course k ts to be con- 
sidered mod.n and k€ mod.n€, but v€ is a multiple of 27/y 
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and may consequently be infinite in the limit.) We therefore 
write g in place of k€, where q is understood to be a variable 
which runs through the possible values of the physical quantity q, 
and Vé-y(q) in place of x,. (q) is an arbitrary function, 
whose values are complex numbers, which satisfies the normalizing 
condition 


{ |W(q) Pag = 1. 


On passing to the limit in the second equation of (15.2) we 
find that the quantity e'77 is represented by the linear operator 


(q) > e'"2 + (q). 


Similarly we find from the first equation of (15.2) that 


¥(q) > ¥(q + 9) 


is the operator representing e'??. On returning from finite to 
infinitesimal unitary transformations we find 


q = 8$(q) = 49° $(q), p:8P(q) = =F. (oo:8) 


We have thus finally justified the assumption from which we 
started in Chapter IJ. 

The extension of these results to systems with several degrees 
of freedom causes no trouble. The kinematics of a system which 
1s expressed by a continuous Abelian group of rotations 1s conse- 
quently determined uniquely by the number f of degrees of freedom. 
The postulate of irreducibility allows us to conclude that the 
particular operators (15.3) of the Schrodinger theory are a 
necessary consequence of Heisenberg’s commutation rules.?? 

P. Fordan and Ek. Wigner ® have given a very elegant group- 
theoretic proof that there exists but one irreducible matrix 
solution of equations (14.10), 1.e. that one of degree 2/ there 
mentioned and given in greater detail at the end of §11. 


CHAPTER V 


THE SYMMETRIC PERMUTATION GROUP AND THE 
ALGEBRA OF SYMMETRIC TRANSFORMATIONS 


A. GENERAL THEORY 


§ 1. The Group Induced in Tensor Space and the 
Algebra of Symmetric Transformations 


HE principal problem we propose to solve in this chapter 
/ 1s the group-theoretic classification of line spectra of an atom 
consisting of an arbitrary number, say f, of electrons, 
taking into account the reduction of the space R! to {R, as re- 
quired by the Pault exclusion principle, and the spinning electron. 
For this it 1s necessary to consider in detail the representations 
of the symmetric group, 1.e. the group 7, of all f! permutations of 
f things. These are most intimately related to the representa- 
tions of the group u of all unitary transformations or the group 
c of all homogeneous linear transformations of a space ‘R,. 
This connection has already been touched upon in Chapter ITI, 
§ 5: the substratum of a representation of ¢ or u consists of the 
linear manifold of all tensors of order f in SR, which satisfy 
certain symmetry conditions, and the symmetry properties of 
a tensor are expressed by linear relations between it and the 
tensors obtained from it by the f! permutations. 

A tensor F of order fin the n-dimensional vector space ® = VK, 
is defined by its mf components or, as we prefer to say, ‘‘ co- 
efficients ’’ F(t,t, °° 2,); each of the indices 7 runs from 1 to 1. 
Tensors can be added and multiplied by arbitrary numbers ; 
hence the totality of such tensors F constitute a linear ‘‘ vector 
space’? Hf of nf dimensions. Further, F can be subjected to 
an arbitrary permutation s of its f indices, which can be thought 
of as a permutation of the f numbers 1, 2, °°: -, f attached to 
the indices 7 in the general component above; if s is the per- 
mutation 


1+, 2—> 2, _ oP haa os 
281 
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then the tensor sf obtained by applying s to F is, by definition, 
that tensor whose coefficients are 


SFliyig + + + is) = Fliyiy + + + Zp). (1.1) 


It follows from this definition that for any two permutations 
sand lt 


tisk) = (ts)F. 
A linear correspondence F —> F’: 
ae. eee) = daly Ro eye ee de Ee Re ee pony LD) 
is said to be symmetric if the coefficient 
a(iy +++ ty; Ry ++ hy) 


is unaltered on subjecting the sub-indices 1, 2,+--, f of both the 
indices 2 and k to the same arbitrary permutation s. The pro- 
cesses of addition, multiplication by a number and permutation, 
in the sense defined above, applied to tensors are invariant 
under symmetric linear transformations; and conversely, any 
transformation of tensor space under which these processes 
are invariant is linear and symmetric. The totality of symmetric 
correspondences constitutes an algebra 2’: if A and B are ele- 
ments of 2 then A + B, AB and cA (c an arbitrary number) 
are also.. The problem with which we shall concern ourselves 
is the reduction of SW into linear sub-spaces $8 which are in- 
variant with respect to 2, 1.e. with respect to all symmetric linear 
transformations, Wherever in the following we employ the 
terms invariant, irreducible, etc., in referring to the tensor space 
RS, they are to be interpreted with respect to the algebra Z. 

We give a brief résumé of our terminology. We are dealing 
with a vector space ® and a system 2 of linear correspondences 


Ere = AY 


of Rt on itself; we may often prefer to use the term “ linear 
projection’ instead of “linear correspondence (operator) ’? in 
order to bring out the fact that the correspondence need not 
be one-to-one. A (linear) sub-space S$ of R is invariant if an 
arbitrary projection A of the system 2 sends every vector 
rt of $$ over into a vector of $8; $B is trreducible if it contains 
no invariant sub-space other than itself and the space 0 con- 
sisting only of the vector 0. We shall always understand by 
a compiete reduction J == 8%, + SB, of the invariant sub-space 
% a complete reduction into two linearly independent znvariant 
sub-spaces §8,, 38, even when this is not explicitly stated. A 
linear projection y->y’ of the invariant sub-space $$ on the 
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invariant sub-space ‘8’ is similar if two vectors x and y of & 
which are related by a correspondence A of thesystem: b = Ar, 
are always projected into two vectors x’ and y’ of $B’ which are 
related by the same A: ’ = Ar’. % and $’ are similar or 
equivalent: SB’ ~ % if a one-to-one linear and similar corre- 
spondence can be set up between §§ and Sf’. In particular, 
these concepts are to be applied to the case in which the vector 
space is the tensor space WR = Rf of nf dimensions and & is 
the totality of symmetric transformations. 

In quantum theory the state of a system consisting of f 
equivalent individuals (electrons) with a system-space ‘1s 
described by a tensor of order fin ®. The energy necessarily 
depends on each of the f individuals in exactly the same way ; 
hence the Hermitian operator which represents the energy 1s 
necessarily symmetric in our sense. The fundamental dynamical 
law therefore allows us to conclude that an invariant sub-space 
% of HR has the property that if the tensor describing the state 
of the system is at any time in ‘$B no influence whatever can drive 
it out. A complete reduction of $Y into invariant sub-spaces 
S$ implies a corresponding reduction of the operator representing 
the energy; hence the term spectrum 1s reduced into classes 
of terms belonging to the various S$, such that the members of 
one class can under no conditions combine with the members 
of another. Naturally this division into non-combining classes 
is to be carried as far as possible. But this problem is exactly 
the one proposed above——the only difference being that we are 
here only concerned with the totality 2 of symmetric Hermitian 
operators. However, this restriction 1s quite irrelevant, for 


any symmetric operator can be written in the form A = A, + 7A, 
where 


1 
A,=;(A+ 4), A= 5(A—A), 


are both Hermitian. 


On going over to a new co-ordinate system in the fundamental 
vector space by means of a non-singular transformation 


te Salih) x, (1.3) 

| 

the coefficients of a tensor /' are transformed in accordance with 

Pty st ay) = 4 a(t yey) (teks) ss Altyhy) + F(Ryke + + + Ry) 
(1.4) 


The transformation (1.3) in vector space induces the symmetric 
transformation (1,4) in tensor space. These induced trans- 
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formations, which we shall call ‘‘ special symmetric transforma- 
tions,’ constitute a group 2) which 1s isomorphic with the com- 
plete linear group ¢ = c,; this representation of ¢ was previously 
denoted by (c)f. The group 2, is contained in the algebra 2. 
Hence a sub-space $ of Rf which is invariant under the algebra 
2 is a fortiort invariant under the group 2». That the converse 
of this result is also valid is not so self-evident. Nevertheless 
for all questions involving only linearity 2) can be replaced by 
the more extended 2, for 2 is what we might call an enveloping 
algebra for the group 2; by this we mean that any symmetric 
transformations can be expressed as a linear combination of 
appropriately chosen special symmetric transformations.! To 
show this we prove the theorem : 

A homogeneous linear relation 


Zins soins hye ky) lig gy by ky) = 0 (15) 
1s satisfied identically by all symmetric transformations 
X(t tt tyes Ry tt Ryd 


if it 1s satisfied by all special symmetric transformations, 1.e. if 
the equation 
Zilin ++ any bys ++ Relaligks) ++ + xlipky) = 0 (1.6) 

is satisfied for all values of the n? variables x(1k) for which 
the determinant |x(ik)| + 0. 

Proof. Denoting the pair (zk) of indices by 7 and calling the 
n? = m values of j simply 1, 2, - - +, m, the left-hand side of 
(1.6) is a homogeneous polynomial of order f in the m variables 
x (1k) = X;: 


f(X1X- 7 28 vl == 2h de: ee fm) hak ac. ee Ce eS 


' 
where f, + fo = ee aaeae == fand b(fi, Se > a TART 
times that coefficient c(jyJ, + * + J7) whose indices.contain 7 = 1 
f, times, 7 = 2 f, times,etc. On denoting that variable x(j172 °° *7,) 
in which the indices 7 = 1, 2,+ ++, m occur fh, fo°°*, fi, times by 
vf, fo, °° *, fm) the left-hand side of equation (1.5) becomes 


Pa ho, ea fm) V(fa, ho, my in): 


The determinant of the x(zk) 1s a certain polynomial D(x,%q° + + %m) 
in the variables x;. Our assertion is thus reduced to the well- 
known theorem of algebra: let ¢(x), D(x) be two polynomials 
in the variables x, x, °° * Xm, the second of which does not vanish 
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algebraically, i.e. its coefficients do not all vanish. If ¢(x) is 
zero for all values of the variables for which the value of 
D(x) + 0, then ¢(x) vanishes algebraically. 

This theorem is proved for a single variable x as follows. 
If ¢(x) does not vanish algebraically it has a definite degree 
p 20; let q be the degree of D(x). There are then at most 
p + q values of the variable x for which ¢(x) or D(x) vanish ; 
for any one of the remaining infinitude of possible values of 
x neither ¢(x) nor D(x) can vanish, contrary to assumption. 
The theorem is readily extended to polynomials in any number 
of variables by mathematical induction. The principal point 
is that the analytical vanishing of a polynomial for all values of 
the independent variables implies that it vanishes algebraically. 

In quantum theory the vector space R is unitary : the transi- 
tion from one normal co-ordinate system to another such is 
accomplished by an arbitrary unary transformation (1.3). 
The transformations thus induced constitute a sub-group 2% 
of 2 which is isomorphic to the unitary group u,, i.e. the 
representation (u)f of the unitary group. I assert that a sub- 
space $8 of R/ which is invariant and irreducible with respect 
to S' remains irreducible not only under the group 2, but under 
the more restricted group 2 as well. To prove this we must 
show that the identity (1.5) holds even when we assume only 
that (1.6) is true for those values of the variables x(1k) with 
unitary matrix. 

One of the most natural proofs of the above theorem con- 
cerning the formal vanishing of a form @ of order f depends on 
the process of “ polarization’: we assign arbitrary infinitesimal 
increments dx, to the values of the variables x,; the identical 
vanishing of @ then allows us to conclude that the differential 


dg 
4 dX; ax; 


vanishes for arbitrary values of x; and dx; This procedure 
also leads us to the desired conclusion in the case under con- 
sideration. Denoting by ® the matrix obtained by transposing 


(i 
tr (@dX) = 0 


where X, X + dX are two arbitrary neighbouring unitary 
matrices. In order that this be the case we must have 


dX = 1X +bX 


rows and columns tin , we have 
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where 6X is an arbitrary Hermitian matrix: the ‘ rotation” 
X + dX is obtained by following up the rotation X with the 
infinitesimal rotation 1 + 2-8X. But the equation 


tr (DX - dX) = 0 


implies the vanishing of ®X. This is seen immediately from 
the fact that a linear form 


Lyx Vik 


in the variables y,, = 8x(1k) vanishes identically if it vanishes 
for all values satisfying the condition y,; = 9,,; indeed, any 
matrix Y = |ly,,|| can be written in the form Y, + 1Y.where Y, 
and Y, are Hermitian. On multiplying the right-hand side 


of ®X =0 by X! we find ®=O0: all derivatives a 
vanish in the same sense as ¢ itself, 1.e. for arbitrary x(7k) whose 
matrix is unitary. But these derivatives are forms of order 
f—1; the truth of our assertion above is thus proved by 
mathematical induction. 

Every invariant sub-space $8 of QW is the representation 
space of representations of the groups ¢ and u which are con- 
tained in (c)f and (u)f respectively. Hence the above results 
prove that if SB is irreducible these representations are also. 


§ 2. Symmetry Classes of Tensors 


One of the most natural methods of obtaining invariant 
manifolds of tensors F consists in subjecting F to linear symmetry 
conditions of the form 


Yas) sk = 0, (2.1) 


8 


This suggests introducing the symmetry operator 
a= Dia(s):s. (2.2) 


Such operators can be added and multiplied with arbitrary 
numbers, and two operators a, Bb can be applied successively 
with the same result as the symmetry operator c = ba defined by 


c(s) = & Bt) a(t’). (2.3) 


In other words, we are here led in a most natural way to the 
algebra p of the symmetric group a = 7, of all permutations s. 
The clements of this algebra, which constitute an f!-dimensional 
linear space t, appear as operators which can be applied to 
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tensors of order f. We may call the numbers a(s) appearing 
in (2.2) the components of the element a. In particular, a is 
an Hermitian operator in the tensor space Rf if it is a real 
element, i.e. if it coincides with its Hermitian conjugate 4 
defined by the equation 


a(s) = a(s}). (2.4) 


Hence these real symmetry operators represent physical quan- 
tities of the physical system consisting of f equivalent individuals, 
whose total system space is R/; quantities of this kind are 
unknown in classical physics and cannot be pictured in terms 
of the usual spatial and temporal models.? 

* (2.1) or 


is a linear condition which is imposed on the element x = F 
defined by x(s) = sk. A symmetry class is defined by one 
or more equations of this kind; we are thus led to the definition : 

Each linear sub-space p of t determines a symmetry class % 
of tensors. F belongs to 3% when the corresponding symmetry 
quantity or element F is in py. It will be found convenient to 
denote the process by which §8 1s generated from p by a symbol ; 
we write % = Zp. 

If the reader finds it difficult to operate with elements F 
whose components sf are tensors rather than numbers he may 
replace the tensor by the totality of its coefhcients F(t, 12° + + 1,) 
and F by the elements 


X = Fliyt, + + + 1y) 


associated with each definite set of indices (2472+ ° + 7,); this x 
is defined by the equation 


x(s) = SF (iin - + i,). 


The requirement that F belong to p means that Fy(2,2. ° > + 1,) 
belongs to p for all the nf possible combinations of the indices 1. 
That the symmetry class $8 = tp is invariant with respect to 
all symmetric transformations (1.2) 1s due to the fact that (1.2) 
implies the corresponding equation for the elements F, F’. 
F'(1,1.' ++ 1,) is a linear combination of the elements F(k,k,° +: R,) 
associated with the various combinations (k,k,- ++ k,) of indices k. 

If F belongs to p then a: F does also, where a is any element 
whatever of the algebra. To show this we note that the 
s-component of 


H(i, ++ + ty) = a+ F(t, + + = ty) 
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is given by 
Dar) + rsk(t, + + + ts) = Ya(r)-sk(k, > + + Ry) 


where the &,, +++, ky are obtained from 2,, ++ +, 27 by the per- 
mutation r. Hence Af (7,, +++, zy) is a linear combination of those 
F(k, - + + k;) whose indices k are obtained by a permutation of 
the indices 2. 

The principal question now 1s whether every invariant 
sub-space §$ can be generated from a p by the process %, and 
further, whether or to what extent this generating ) is uniquely 
determined by 8. The answer is perhaps best expressed with 
the aid of the inverse process | which generates a p from the 
given $8. The following geometrical analogy may be useful 
in enabling the reader to understand the situation with which 
we are dealing. Let the points x of a plane with a fixed centre 
correspond to the elements of the algebra p and the line segments 
Ff going out from the origin correspond to the tensors. On 
contracting the entire plane, leaving the centre invariant, in 
the fixed ratior (0 S71) the point x goes into the point 
tx and the segment F into the segment rf; this contraction 
of segments shall be the analogue of the symmetrical trans- 
formations of tensors. ‘8 will now denote an ‘“ invariant ”’ 
set of segments, i.e. a set such that if it contains the segment F 
it also contains all the contracted segments rF. Just as we 
associated the symmetry elements F(z, - ++ 7,) with the tensor F 
we now associate with the segment F the continuum of points 
F(r) of F; F(r) is the end point of the segment 7F. Let p be 
any set of points; the segment F will then be included in the set 
= tp if and only if all its points F(r) are in p. Obviously the 
only segment sets $8 which can be obtained in this way are 
those which are invariant, and all such invariant sets can be 
so obtained. Only the “ core’’ py of the point set p is essential 
to this construction; pg consists only of those points x such 
that 7x belongs to p for all 7 (in the interval OS 7S). fy 
is invariant in the sense that with x all 7x belong to py. That 
only the core py is essential means that our construction generates 
the same segment set ‘8 from two point sets p, p’ if these latter 
have the same core; hence we can restrict ourselves ab initio 
to the consideration of invariant point sets p = py. It is extra- 
ordinarily easy to find the point set p which generates a given 
segment set $$: we include in p those and only those points 
lying on the segments of §8, and this p is automatically invariant. 

If the reader will think through this geometrical illustration, 
which we have formulated here in such a pedantic manner, he 
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will have no trouble in understanding the analogous situation 
for tensors and symmetry elements. A linear sub-space p of t 
is to be called invariant if all elements ax are in p, where x is 
an arbitrary element of » and a is any element whatever.* 
Hence such a p ts invariant under the totality of correspondences 
of the form 


(a): x-—> x’ = ax (2.5) 


On associating this correspondence (a) of rt on itself with the 
element a we obviously obtain a representation of the algebra p 
(and therefore of the group a,); it is called the regular 
representation. (t appears here twice: once as the repre- 
sentation space and again as the algebra p represented in this 
space; the first will be expressed by the German letter tr, the 
second by the Greck p. We are here doing the same thing as 
in III, § 2, where we obtained a realization of the group g by 
associating with the element a of g the correspondence s > s’ = as 
of the group manifold on itself.) This regular representation 
supplies us with material from which we can construct all— 
and herce in particular the inequivalent irreducible—repre- 
sentations of the algebra p. When we use the terms invariant, 
irreducible, ctc., in t they will always refer to the algebra of all 
correspondences (a) of r on itself, which is simply isomorphic 
with the algebra p of all symmetry elements a. p being an 
invariant sub-space of t, we shall always refer to the representa- 
tion induced in » by the regular representation simply as the 
regular representation 1n p; it associates with each element a 
the correspondence (2.5) of p on itself. The equation x’ = ax 
is, in terms of components, 


a a alr) x(rs). 


Let x be an arbitrary clement of »; the requirement that p be 
invariant allows us to conclude that the element x’ defined by 
x'(s) == x(rs) is also in p, where x is any fixed permutation. 

Let p be an arbitrary sub-space of t; we say that x belongs 
to the core Py. of p if and only if all quantities of the form ax 
belong to p; this p, is invariant. We thus have the theorem 
that two linear sub-spaces p, p’ generate the same symmetry 
class 3 = tp — th’ of tensors if they have the same core. We 
may therefore restrict ourselves ab initio to the consideration of 
invariant sub-spaces p. 


* This ‘‘ invariant sub-space’’ is not the same as an “ invariant sub- 
algebra ’’ as defined in Chap. III, § 13; to conform with our previous nomen- 
clature it should be called a “‘ left-invariant sub-aigebra.”’ 
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It is possible that certain relations (2.1) will be satisfied by 
all tensors. Let t, denote the smallest sub-space of 7 which 
contains the elements F(127, + + * 7;) associated with all tensors 
F and all values of the indices (2,7, - + + 2s). Then p generates 
the same 8 = tb as the intersection of p with t,; it is therefore 
natural to restrict ourselves further to the consideration of 
invariant sub-spaces ) of t. These remarks are not applicable 
if the dimensionality ” = f, for certainly the /! coefficients 


SF(1,2,-° fl) = FV, 2° 4 f) 


of the arbitrary tensor F are independent. But the situation 
is different in case 1<f: for example, let 6, = + 1 according 
as $ is an even or an odd permutation; then 


»o;° SF 


is an anti-symmetric tensor and must therefore vanish in case 
the dimensionality ” is less than the order f. 

We can at most hope that conversely p is uniquely determined 
by ‘8 if we restrict ourselves to tnvariant sub-spaces » which are 
contained 1n ty. In order to prove that this 1s indeed the case 
we attempt to find the inverse process which leads from % 
to p, following the programme outlined by the geometrical 
analogy considered above. In case » =f this is readily done 
as follows: if F is any tensor in ‘S$ we let the element 
x = F(1, 2,- - -, f) in vt correspond to it; p consists of all the 
elements x so obtained. But in order to obtain a method which 
is also applicable to the case  < f we must alter the procedure. 
We understand by p = {8 the smallest linear manifold containing 
the totality of elements F(1,, 12, °° *, 17) associated with all possible 
tensors F of $$ and all possible combinations of indices (1,1. ° + * 14). 
If the tensors &, constitute a basis for §8, p consists of all elements 
of the form 

«= XKealty st dy) By(ty + + + 14) (2.6) 
That such a p is invariant has already been shown above, for 
if X = F(zyt, + + + 14) the element x’ defined by x’(s) = x(rs) is 
equal to F(k,k, + + + ky) where kik, + + + ky are obtained from 
1419 °° * 1, by the fixed permutation r. 

We now denote the rp introduced above by bo ; it coin- 
cides with the entire space t when n 2 f/f. Let the symbol 3 
denote ‘‘1s contained in’’; the following results then follow 
immediately from the definitions: If p is a linear sub-space of 
rand % = tp, then Le 3p. If $ is any linear sub-space of 
RH and p= |, then conversely PB 3 ap. We can at most 
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expect that the symbol 3 can be replaced by = if in the first 
theorem ) is an invariant sub-space of ty and in the second if 
$8 is an invariant sub-space of R/. That these converse theorems 
are in fact true under these limitations will be proved in § 4. 


§ 3. Invariant Sub-spaces in Group Space 


We are in need of a fundamental theorem concerning the 
algebra of a group as a preparation for carrying through the 
investigation proposed above; we here prove this theorem for 
a general finite group. However, we do not alter the notation, 
so here wm denotes any finite group of order h. 

Theorem (3.1). If 1s an tnvartant sub-space of t there exists 
an element e of the group algebra having the following two prop- 
erties: (1) every element of the form xe belongs to ), (2) every 
element x of ) satisfies the equation xe = x. 

In particular (1) implies that e = le itself belongs to p, 
and hence by (2) ee=e; e 1s idempotent.’ Itis a ‘‘generat= 
ing unit” of p in the sense that p consists of all elements of the 
form xe. 

Proof. Let ey, @s, * + *, €, be a co-ordinate system in the 
vector space t which 1s adapted to the g-dimensional sub-space 
p in such a way that p is the linear set defined by e,, @9, °° +, e,. 
The parallel projection which transforms 


) 


X= %,€, +: ° ++ x,e, Into xX'= xe, +--°--++4%,e, 


has the two properties (1) it projects every x into an x’ lying in 
p, and (2) within p it is the identity. In the original co-ordinate 
system defined by the simple elements s of the algebra this 
projection is given by 


x'(s) = Sls, )x(0), 
t 
where the matrix d(s, t) is necessarily of the form 
d(s, t) = e(s)A(t) ++ + + + en(s)éo(t) 
and the é;(s) are defined by 
Dé,(s)ex(5) are Gin (2, k= I, 2, my gi 
The fact that p is invariant implies that if x is in p then the 
clement x, defined by x,(s) == x(rs) is also in p. Consequently 


the projection with the matrix d(rs, rt) has the same two prop- 
erties (1) and (2), where 7 is any fixed permutation (i.e. element 
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of the group 7) whatever. Hence the assertions also hold for 
the correspondence with the matrix 
1 

e(s, t) = j »4a(rs, rt) (3.2) 
obtained by summing over all elements 7 of the group. This 
matrix satisfies the equation 

e(rs, rt) = e(s, bt), 

whence e(s, t) depends only on the combination t74s: e(s, t) = 
e(t4s). The linear projection 


x’(s) = Dre(s, t) x(t) 


may therefore be written briefly x’ = xe, which proves the 
validity of the theorem. 

Let the invariant sub-space p be completely reduced into two 
invariant sub-spaces: p = p,-+ po, and let e be the generating 
unit of p. Any element in » can be written as the sum of 
its components in p, and p,; hence in particular e = e, + é,. 
From this it follows that for an arbitrary element x of p 

X = Xe = Xe, + Xe. 


But since X, = Xe, 1s in p, and xX, = Xe, is in P., X, and xX, 
are the (unique) components of x in ), and p,. These two 
components for the element e, are obviously e, and 0, whence 


€\e, =e), ee, = 0; 
similarly 
CoC, — 0, CoO o = C.. 


Hence é€,, €, are the generating idempotent units of p,, p, re- 
spectively; they are ‘‘ independent” in the sense of the 
equations 


€10C, —— 0, Ce, =— Q. 
On completely reducing p into any number of components : 
p= D’p,, the generating unit e of p is decomposed into 
i 
e¢ = IX 
7 
the components of which satisfy the analogous equations 
eC; ¢, = 0 (1 =+ R), €, OC; =: @;j. 


The existence of the generating unit offers a means of ob- 
taining a new and simpler proof of the fact that reducibility 
implies complete reducibility : 


INVARIANT SUB-SPACES IN GROUP SPACE 293 


Theorem (3.3). If), p, are invariant and p, 3 p, then ) can 
be reduced into ), +- p. in such a way that ~, 1s also invariant. 

Proof. Let e,; be the generating unit of p,. We decompose 
every element of ~ in accordance with the equation 


X= xe, + (X —- Xe). (3.4) 
The first component x, = Xe, lies in p,, and the second 
Xo —=— xX — XC, 


runs through a certain linear sub-space ~, of p when x runs 
through all elements of p. This sub-space p, is also invariant, 
for 


AX, = ax — (ax)e, 


as ax isin pif x is. The elements xX,, X, of P,, Pp, respectively 
satisfy the equations 


Xe; = X41, Xo€) — 0. 


From this it follows that the sum of an element y, of p, and an 
element x, of p, cannot vanish unless both y, and x, also vanish ; 
hence p, and p, are independent. ‘To prove this we merely note 
that on multiplying y, + x, = 0 by e, we find y,e, = y, = 0. 
Equation (3.4) represents the reduction of any element of p 
into its components in p, and pg. 


Any idempotent element e generates an invariant sub-space 
p, consisting of elements of the form xe. If e,, e, are two 
independent idempotent elements (e,e, = 0, e,e, = 0) then the 
sub-spaccs p,, py which they generate are independent, and the 
idempotent element e = e,-+ e, generates p=p, +p, An 
idempotent clement e is said to be primitive if it can only be 
expressed as the sum of two idempotent elements e, + e, if 
one of the summands 1s 0 (and the other e). J» order that p, 
be wrreducible it is necessary and sufficient that e be primitive. 

Obviously any idempotent clement e, in particular the 
modulus 1 of the algebra, can be reduced into the sum of 
independent primitive idempotent elements. For if we have 
a reduction into independent non-vanishing 1dempotent elements 


¢ = C1 +- C> —{- on ny + Cm 
and if, for example, e, is not primitive, it can be further re- 
duced to the sum of two independent non-vanishing idempotent 


elements e,’ +- e,"’; in this way we obtain a complete reduction 
of e into m + 1 independent terms, for we have, for example, 


€,€, = €)€,€, = 0; similarly e,e,; = 0. 
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This process must certainly cease after at most h steps. Our 
analysis allows us to assert that we thus obtain a complete 
reduction of p, into independent irreducible sub-spaces. 

We have seen that the theorem concerning the complete 
reducibility is a consequence of the existence of a generating 
unit. But the converse is also true: If p appears as a summand 
in a complete reduction r = p + ph’ of our given algebra t, then 
it possesses a generating unit. We need only to specialize the 
considerations developed above by applying them to the modulus 
1 of r; 1 can be completely reduced into the two components 
e-+ e’ lying in » and p’, and the generating units of p and p’ 
are e and e’ respectively. 

The mathematician will find it worthy of note that all these 
considerations are still applicable when the algebra is defined 
over any ficld whatever. Instead of dealing with the continuum 
of real or complex numbers, as in analysis, we may in abstract 
algebra operate in an arbitrary field, i.e. a domain of elements, 
called numbers, in which the two fundamental operations of 
addition and multiplication and their inverses, subtraction and 
division, are defined in accordance with the formal laws of 
ordinary arithmetic. Our development depended only on these 
rules of operation—with a slight restriction. There are fields in 
which a definite integer, say h, times any number of the field 
yields zero; we may say that h annihilates. Such ‘‘ modular ”’ 
fields must be excluded, for we wish to retain the possibility 
of finding a number such that its product with i is any given 
number. When our reasoning involves no more restrictive 
assumptions concerning the number field, we are operating in 
a relatively elementary theoretical domain. I{fowever, such 
theorems as the ‘‘ fundamental theorem’’ III, (10.5), and that 
of Burnside-Frobenius-Schur, which depend on the fundamental 
theorem of algebra, belong to a deeper layer. These theorems 
hold only in ‘‘ algebraically closed’ number fields, in which 
any algebraic equation (with coefficients in the field) is soluble. 
Finally such concepts as ‘‘ Hermitian,” ‘‘ unitary,” etc., involve 
the transition from a number to its conjugate complex and 
have no place in general abstract fields. Our earlier proof of 
the theorem of complete reducibility was obtained with the 
aid of such tools foreign to the general concept of a field. 

Theorem (3.5). A similarity projection x — x’ of the invariant 
sub-space ) on the invariant sub-space p’ 1s necessarily expressed 
by an equation of the form x’ = xb. (In particular, when p 
and p’ are equivalent this theorem is applicable to the one-to-one 
similarity correspondence p 5 p’.) 

Proof. Let the given similarity correspondence send the 
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generating unit e of p over into b. In virtue of the similarity 
xe then goes over into x’ = xb, where x is any element in p; 
but for such an element xe = x. 

Additional remark. The projection sends e into eb; hence 
eb = b. On the other hand, if e’ is the generating element of 
p’, then since B is in p’ we have be’ = b: 


b = eb = be’ = ebe’. 


We express this result, i.e. that b is of the form exe’, by saying 
b has the character (e, e’). Our considerations show that such 


a projection can always be expressed in terms of a unique 
element 6 of character (e, e’). 


If we are operating in the field of complex numbers, with which the 
investigations of analysis (e.g. the theory of functions) deal and in 
which we are exclusively interested in quantum theory, we may supple- 
ment the theorem (3.1) concerning the existence of a generating unit e 
in an invariant sub-space yp by the following : 

The generating unit may be so chosen that tt 1s veal ; tt 1s then deter- 
mined uniquely by y». 


To prove this we choose as the basis é,, é,,..., e€, of » a unitary- 
orthogonal system of vectors ; then 
De ,(S) ez (s) = 84 Ct ae) ee ee 2 
8 


In constructing d(s, #), which we now denote by e(s, ¢), we may therefore 
choose %,; = é;: 


g " 
e(s, t) = Se,(s) e,(t). (3.6) 
t=1 
I assert that the equation 


e(rs, rt) = e(s, ?) (3.7) 


is automatically satisfied—it is no longer necessary to take its mean 
value as in (3.2). The element e defined by e(t-'s) = e(s, ¢) is then the 
real generating unit of yp. 
In order to establish the validity of (3.7) it is only necessary to 
note that e(s, t) is independent of the particular unitary basis e,, é, 
., €, chosen ; for on going over to a new unitary basis e,, Oop ais 


€, by a unitary transformation U the bilinear form (3.6) remains in- 
variant. Now in particular the equation 


el(s) = e,(rs), 


in which »¢ is a fixed element of the group, defines a transition to a new 
unitary basis. 

To prove that this real generating unit e of » is unique, assume there 
exists a second, e’; then all elements x of yp satisfy the equations 


a 


xe= xX, xe’= x. 


On applying the first equation for x = e’ and the second for x = e we 
have : 
e’e=e', ee =e. 
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But since e and e’ are both real, the first of these results yields, on 
going over to the Hermitian conjugates, 

ee’ = e’, 
and from this and the previous result we conclude that e’ = e. 

Under these conditions the content of theorem (3.3) can be extended 
and its proof simplified. If e, e, are the real generating units of yp, p, 
respectively, then since e, is in » @,e = e,, and on going over to the 
Hermitian conjugates we find ee, = e,. Hence the idempotent element 
e, introduced by e = e, + e, 1s real and independent of e,; = yp, +}. 
is thus completely reduced into », and an invariant sub-space », which 
is unitary-orthogonal to », and which has as its real generating unit ég. 


§ 4. Invariant Sub-spaces in Tensor Space 


We now return to the investigation of tensors of order f, 
the totality of which constitutes the space RY. Let a again be 
the group of all permutations of f things and t ( = p) the corre- 
sponding group space (algebra). Let a be asymmetry quantity, 
i.e. an element of the algebra p, with components a(s); the 
element @ is then defined by 


a(s) = a(s7}) (4.1) 
The relation 
Pe ab, 


which asserts that the tensor F’ is obtained from F by the 
operator a, is equivalent to the equation 


F'=F-a 


between the corresponding clements F and F’ of the algebra p. 
For 
SF’ = Ya(t)-stF 
t 


is in fact obtained from 
ss Ma(t)+ tf = Dali"): tPF 
t 


é 
by operating on it with the permutation s. 

In the following considerations, which are concerned with 
symmetry classes of tensors, p (with or without index) always 
denotes an invariant sub-space of rt, @ the generating unit of 
p and §8 the corresponding tp. We may then say that e is 
the generating idempotent operator of the symmetry class $$ in 
the following sense : 

(1) eF lies in §§, F being any tensor whatever ; 

(2) if F is in $B it is reproduced by the operator e: eF = F, 

In this way we obtain avconstructive definition of the symmetry 
class $8 as the totality of all tensors of the form eF. This definition 
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is considerably simpler than the original one in terms of p, for 
it depends on a single element e instead of a mantfold p. If, 
for example, we are dealing with the class ‘8 of all completely 
symmetric tensors 


1 
Al as. 


is such an operator; the corresponding operator for the class 
of all anti-symmetric tensors is the alternating sum 


jie I sg 5. 


Theorem (4.2). If p’ 2p or p=, + Pe, we have P’ 32 §, 
T= , + YP, respectively. 

We need to prove only the latter part of this theorem, 
i.e. for the case of complete reduction. The generating unit 
é = @, + é@, of p has as components 6), 6, in ),, p, the generating 
units of p,, P. respectively. The formula 


ef =e, +e,F 


defines the corresponding complete reduction of ‘8 into the 
independent invariant sub-spaces %,, $,. \ 
Theorem (4.3). If), ~ , then $B, ~ Po. 
The similarity correspondence x,-> X, of p, on , 1s, by 
theorem (3.5), of the form 
= x, b, x= x, b’. 
Hence 


f= Dl’ ,. hoz bs, 


define a one-to-one similar correspondence of $8, on SB, and its 
inverse. 

Theorem (4.4). If) 2 tq then p = 48. 

The only non-trivial part of this first converse theorem which 
remains to be proved ts that p 3 EB. All tensors of the form 
fh, = ek, are in ‘$$, where (£,) is a basis for the entire tensor 
space JV; hence all elements of the form 


= Ca (ty st dy) y(t) + + + ty) 
are in $%. On introducing 
X= Litalts sts ty) By(t + + + ty) 


we have y= xé. On recalling the definition of ty = §9/ we 
sec that xé belongs to $B if x lies in ty. But in virtue of the 
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assumption that » 3 fr this is automatically satisfied if x is an 
arbitrary element of p; but then xé = x. Hence every element 
x of p is contained in 49. 

In order to formulate the converse of these theorems let 
%§ (with or without index) now denote an arbitrary invariant 
sub-space of Sf and p the corresponding B®. 

Theorem (4.5). If $B’ 3 8 or P= $%,+ Py, then p’ 3 p, 
pb = p, + Yo, respectively. 

Theorem (4.6). If B ~ PY thenp ~ py’. 

Theorem (4.7). = Sp. 


The last theorem is by far the most important of all; it 
asserts that every 8 1s a symmetry class of tensors. It is desirable 
to prove it first, i.e. to prove that fp 2 9. Let @ again denote 
the generating unit of p; Sp then consists of all tensors of the 
form F’ = eF. Since the element @ belongs to p it is necessarily 
of the form 


&(s) = es) = Deal s+ Ry) SEA(Ry + + + Ry), (4.8) 


where the tensors E, constitute a basis for the space $8. Now 
the trivial equation 


28c(ty tt dy) MSF (ty + + + 1) = 2h scot dy) Ftp ss + ty) 
shows, on replacing Sc by c, that 
Dc(ty st dy) SEC + 6 + 1p) = Zs ret sss ty) Flay s+ + ty). 
Hence we may replace (4.8) by 


e(s) = D'sea(hy +++ ky) F(R, + + + Ry) 


and the coefficients of F’ are then given by 
Pass i) = Dealt st dys Ry sts hy) E(k, + + + Ry) 
where | 


Cally * tps Ry st + Ry) = RSI (ay + + + 24) + Sealy + + + Ry), 


Because of the summation over all elements s of the group 7 
this transformation with coefficients c, 1s symmetric; hence 
the assumption that the sub-space §$ 1s invariant allows us to 
conclude that FP” lies in $8 if the &, do. But this establishes 
our theorem. 

The theorem can also be proved directly, without calling 
on the theorems of § 3, in the following way. That £& is in 
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ip means that F(2\7, - ++ 1,) is in p and is consequently of the 
orm (2.6) : 


F(i, an 14) = adult aed Ly 5 2 ie Ry) ° E(k, ae Ry). 


The £, constitute a basis of 8. Writing ue Bi S- component 
of this equation and replacing the indices 2),+ ++, t¢ by 141° °°, ty 
we find the equation 


F(t, + + + 14) = LS Malt st dy) Ry t+ Ry) + Bulky + + + Ry) 


for the components of F. Since this holds for every permutation 
s~! we may sum over the clements of the group and obtain 


Flin + ++ i) = Zealin ++ tps Bye ++ by) Balley + +» by) 


) 


where the coefficients 


Cat) tt tp; Ryo s+ Ry) = t Ssba(is st dys Ry ss + Ry) 


L'% 
are symmetric. Hence since the /, belong to the invariant 
sub-space ‘$8 and F is obtained from them by a symmetric 
transformation, F also belongs to $f. 

The only part of theorem (4.5) which is not self-evident is 
the assertion that p,, p, are independent. By theorem (4.7) we 
have the relations 


tp* 2 3p, 3B, Bpt 3 Pe 


for the (invariant) intersection p* of p, and ),. But since 
%,, $. are independent it follows that Zp*, and therefore p*, 
is empty. 

Theorem (4.5) shows the §8 associated with an irreducible p 
is also irreducible. Hence it follows, in particular, that the 
manifold of symmetric and the manifold of anti-symmetric tensors 
are irreducible and invariant, not only with respect to the algebra 
of symmetric transformations, but also with respect to the 
transformations induced in tensor space by the affine or unitary 
groups of transformations in the vector space i. Applying 
this to the 2-dimensional vector space, we see that the repre- 
sentations ©, of ¢c = c, or u constructed in ITI, § 5, are irreducible. 

In order to prove (4.6) we must first examine the nature of 
to (for 2</f) in some detail. We call the component a(I) of 
an element a of the algebra the trace of a. Hence the trace 
of the product ab, which we call the scalar product tr(ab) 


of a and BD, is 
tr(ab) = D'a(s)b(s7}). 
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The trace of a is then tr(al) = tr(la) = tr(a). The scalar 
product is obviously symmetric in a and b, and the symmetric 
bilinear form tr(ab) is non-degencratc, i.c. a = 0 is the only 
element for which the equation tr(ax) = 0 1s satisfied identically 
in X, 

Auxiliary theorem (4.9). %) 1s a left- as well as right-invariant 
sub-algebra of i. tr(ab) 1s non-degenerate within Yo, 1.e. the only 
element a of Yq whose scalar product with every element x of Yo 
vanishes 1s a == 0. 

The first part of this theorem is almost self-evident. For 
if x = F(i,---1,), the element x’ defined by x’(s) = x(sr) is 
F'(i, - + + t,) where PF’ = rF. 

Let f be the generating unit of t), a an element of rt) and 
x an arbitrary element. Then since f) is right-invariant ax 
is also in Yy, whence 


ax = ax:i, tr(ax) = tr(a- xi). 


Now xi is in t); hence if the scalar product of a with every 
element xi of t) vanishes then tr(ax) = 0 without restriction on 
x. It therefore follows that a = 0, as asserted. 

Proof of theorem (4.6). Let E, be a basis for $8, and let the 
similarity correspondence of §§ on $f’ send E, into the basis E, 
for B’. Let c,(1, ++ + ty) be a given system of coefficients and 
write 

CSD 24) PE ee ey) (4.10) 


oO == Seat, + + * ty) * Balt, + + + ty). 


The desired similarity correspondence between ) and p’ is naturally 
to be defined by c-> c’. However, this is only possible provided 
two systems of coefficients c,(1; > + + 1,) which define the same 
c also define the same c’; or a system of coefficients which 
causes c to vanish must also cause c’ to vanish. 

We first remark that if a tensor F satisfhes the equation 


G= Je(s3)-sh = 
then also 
G = Se'(s4)-sk =: 0. 
By (4.10) 
e(s-8) == Es ca(ky + +» hy) Bally * +» hy) 
whence 


Glin ++ i) = LE ealty + + ips Ry + + bey) B alley + + by) 
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where 
Calty ot tps Ry tts Ry) = YP (ty + + + ty) + Sca(ky + + + Ry). 


These c, define a symmetric transtormation. Hence the given 
similarity transformation $$ -> SB’, which sends E, into E,’, sends 
G into G. This proves our assertion that the vanishing of 
G implies the vanishing of G’. 

If ¢ = 0 we then have 


Se'(s}) + sFliy ++ + i,) = tele’ Fly « * + i] =0 


for all tensors / and all combinations of indices 1, + + + t,, or 
tr(c’x) = 0 for all clements x of to. Hence by the auxiliary 
theorem (4.9) c’ = 0. 

The result of our investigations is that there exists a one-to-one 
correspondence between the invariant sub-spaces ) of Yq and the 
invariant sub-spaces $8 of RY. This correspondence ts as close 
as possible ; irreducibility, complete reduction, equivalence and 
inequivalence on the one hand imply the same on the other. In 
particular, we emphasize the further consequence : 

Theorem (4.11). Fuery invariant sub-space % of Ri, in 
particular RS itself, can be completely reduced into irreducible 
invariant sub-spaces. 

[ hope that our clementary methods have made this corre- 
spondence quite apparent. 

It is evident a priort that we can completely reduce the 
modulus 1 of the algebra p into asum e, + e, + ++- + e,, of in- 
dependent primitive idempotent elements. The formula 


=e Fk +e) 4+-+-+-+e,F 


then gives the complete reduction of RY into independent in- 
variant sub-spaces ‘B8,, %., °° +, Bm, cach of which is generated 
by one of the idempotent operators e. (‘$, consists of all tensors 
of the form e,F.) From this point of view we might consider 
as the only non-trivial result of our investigation the assertion 
that the §8 generated by a primitive e is irreducible (with respect 
to the algebra 2 of all symmetric transformations). Physically 
this means that the class of terms corresponding to such a % 
cannot be further divided into parts which cannot -under any 
conditions interact with each other. If in spite of this there 
does exist such a decomposition it is accidental—i.e. attributable 
to the special dynamical situation in the case in question. 
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§ 5. Fields and Algebras 


We here interrupt our development in order to present an 
axiomatic treatment of the two fundamental concepts field and 
algebra ; our investigation has revealed the importance of these 
concepts for quantum theory. The physicist who ts not par- 
ticularly interested in such a treatment may well omit these 
sections. 

A field is a domain of elements, called numbers, within 
which the two operations of addition and multiplication are 
defined and which associate with any two numbers «, f of the 
field certain unique numbers « + f, af respectively. Addition 
obeys the commutative and associative laws 


at+B=Bto, (1+ 8)+y=a+B+y) 


and has a unique inverse, subiraction. From this follows the 

existence of a unique number o (zero) with the property 

ato=o+a=e for all « Further, associated with each 

number ais anumber — a, its negative, such that « + (~— a) =o. 
We require that multiplication obey the associative law 


(aB)y == «(By) 


and the distributive laws 


(a + B)y = (ay) + (By), (8B + y) = (a8) + (ay) 


with respect to addition. From the distributive law follow 
the relations 


“ZO == 0% = O. 


Multiplication need not be commutative ; 1n case it is we speak 
of a commutative field. ‘further, division by any number 
other than o shall be possible and shall lead to a unique quotient, 
i.e. each of the equations 


af =f, ya=f 


have for given « +0 and given f one and only one solution 
€, 7 respectively. From this it follows that the product af of 
two numbers can only be o if one of the two factorsiso. Asa 
further consequence, there exists a number ¢, ‘‘ one” or “‘ unity,” 
with the property that 


OE = EX— 4 


for all a We explicitly assume that not all numbers equal o; 
then in particular ¢ +0. Every number « =+o possesses a 
unique reciprocal a7! with the property aa! =: a"!a = «. 
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We must introduce in addition to the numbers of our field 
the ordinary numerical symbols 1, 2, 3, - + +. Their inter- 
pretation as multipliers is given by the equations 


la=a, 2a=ata, 3a=— (2a) +4,-°--: 
in general 


(n + l)a = (na) + a. 
In particular we can construct the series 
le, 26, > ++, me, ° °° (5.1) 


of multiples of ¢. We then have two possibilities. (1) All the 
numbers of this set may differ frome; then they are all different, 
and we can conclude with the aid of the equation 


np = ne-B 
and the division axiom that for a given number « there exists 
a ; 
one and only one number B = . which satishes the equation 


nB =a; we can then introduce ordinary rational numbers as 
multipliers. (2) The second possibility is that one of the multiples 
in (5.1) is equal to ¢€ itself; let the least multiple of this kind be 
pe. Then the numbers of the series (5.1) repeat in cycles of 
length p. p must be a prime number, for if p were the product 
of two integers m, n smaller than p we would then have 


O == pe = ME° NE, 


but by assumption neither me nor me are o, for pe is the lowest 
multiple of this kind, and this is contrary to the division axiom. 
In this case we are dealing with a@ finite field of modulus p.* 

In order not to lose ourselves in too broad generalities we 
now take as our number domain a commutative field and define 
a linear associative algebra of finite order over this field. 
By number we mean the elements of the field, and denote its zero o 
and its unit ¢ by 0 and 1; by element we mean an element of the 
algebra. We denote the former by small Greek and the latter by 
small Latin letters. An algebra is characterized by three fundamen- 
tal operations: addition of two clements, a+ 0; multiplication of 
an element by a number, ya; multiplication of two elements, ab. 
The first and second of these operations obey the familiar axioms 
of vector calculus (I, § 1), which we set forth here again for the 
sake of completeness. 

Addition is commutative and associative and has a unique 
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inverse, subtraction. It then follows that there exists a null- 
element 0. Multiplication by a number obeys the laws 


la= a, a(Bc) = (aB)c, 
(a + B)c == (ac) + (Be), @(b + ¢) = (aB) + (ae). 


The order h is introduced by the dimensionality axiom: every 
h + 1 elements of the algebra are linearly dependent, the co- 
efficients in the equations expressing the dependence being 
numbers of the field, but there exist hk linearly independent 
elements. <A set of A such elements e,, @9, ° + *, é,, called ‘* basal 
units,’’ form a basis for the algebra in the sense that any element 
a can be expressed in one and only one way in the form 


a= X16) -- Kolo + pie Bona oe +- nen 
and can be replaced by the set (a, a, * + °, a) of % numerical 
components. 


Multiplication of elements among themselves obcys the 
distributive laws 


(a + b)c = (ac) -+ (bc), cla + b) = (ca) + (cb) 
for both factors and the associative laws 


ya:b=y(ab), bya = y(ba), 
(ab)¢ = a(bc) 


We neither assume that multiplication is commutative nor 
that it possesses a unique inverse, division. But we do assume 
that the algebra possesses a ‘“‘ one,’ the modulus (or principal 
unit), i.e. an element e with the property ae == ea = a for all 
elements a. We shall usually not hesitate to denote the zero 
and one of the elements of the algebra by 0 and 1. 

If we assume the possibility of division the algebra reduces 


to a (in general non-commutative) field or division algebra of 
finite order hk over the given field. 


§ 6. Representations of Algebras 


For the sake of the printer and in order to give the text a 
more peaceful appearance we no longer emphasize the elements 
of our algebra by expressing them in boldface type. This 
applies in particular to the elements of the algebra p of ‘‘ sym- 
metry quantities ''—which we may often denote by this latter 
expression in case of possible confusion with the clements of 
the underlying group. We still employ this means of distinguish- 
ing between the tensor F and the symmetry element F or when 
we wish to consider an element as an operator acting on a tensor. 
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We start with an algebra p of finite order h, the elements of 
which constitute an k-dimensional vector space t, and associate 
with the element a of p the correspondence 


(a): x—>x' = ax 


of t on itself. We consider the algebra (p) of transformations 
(a), which is simply isomorphic with the algebra p, as funda- 
mental for the vector space ft, 1.e. the term reducible, invariance, 
etc., as applied to sub-spaces of t are with respect to the 
group of transformations (a). We assume that t can be com- 
pletely reduced into irreducible sub-spaces p, + P2 +++ +; cach of 
these sub-spaces then contains an idempotent generating unit 
€1, €2,° °°. We have already seen that this assumption ts true 
for the algebra associated with any finite group—at least under 
the restriction that the field over which the algebra ts defined 
does not have as modulus a.prime number which 1s a factor of 
the order kh of the group. 

We discussed the representations of a group or of the corre- 
sponding algebra in Chapter III. We found that the irreducible 
representations are subject to certain important conditions 
which, surprisingly enough, limit their number and which, 
together with the as yet unproved ‘ completeness theorem,”’ 
lead to the reduction of the given algebra into independent 
simple matric algebras (III, § 13). That we were unable to 
prove the completeness theorem with the methods there em- 
ployed was to be expected, for we assumed that the representa- 
tions were given and examined their properties; we had no 
general process for the construction of representations of the 
given algebra. But we are now in possession of the materials 
for such a construction: the reduction of ¢ into irreducible 
sub-spaces p, reduces the regular representation into as many 
inequivalent irreducible representations of our algebra as there 
are inequivalent invariant sub-spaces p; We shall now carry 
out this construction process to the point of obtaining the re- 
duction of our algebra into independent simple matric algebras ; 
it will be desirable to derive the previous results again from this 
standpoint. A further difference between this investigation 
and that of Chapter III consists in the fact that we here refrain 
as long as possible from placing restrictive assumptions on the 
commutative field over which the algebra is defined; only at 
the end of the investigation do we discuss the advantages at- 
tributable to the fact that the continuum of complex numbers, 
the only field in which we are interested for the physical appli- 
cations, is algebraically closed. 
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Theorem (6.1). Every representation of the algebra p 1s com- 
pletely reducible into irreducible representations. Each of these 
irreducible constituents 1s equivalent to the representation induced 
in some ), by the regular representation. 

(Hence the complete reducibility of the given algebra implies 
the complete reducibility of its representations. Further, every 
irreducible representation is contained in the regular repre- 
sentation, which therefore constitutes an appropriate starting 
point for obtaining all representations by the method of reduction). 

Let § be an n-dimensional representation, and let e, €2, °° °, 
e, be m fundamental vectors constituting a co-ordinate system 
in the representation space R of . If the clement a of the 
algebra corresponds to the linear correspondence A in §, we 
interpret the equation 


. ar. as- 2 =r. 


where x’, r are vectors in ®. Ife is a given fixed vector and x 
runs through all elements of one of the irreducible invariant 
sub-spaces p =p, of r then, as we shall show immediately, 
xe runs through a certain sub-space p(e) of & which is invariant 
with respect to §9. Indeed, the transformation A associated 
with an arbitrary element a sends xe over into (ax)e, and if 
xisinp, avis also. p(e) is either 0 or is similar to p in the sense 
that different x gencrate different images xe, for those x of p 
for which xe = 0 constitute an invariant sub-space p’ of », and 
in virtue of the assumption that p was irreducible p’ must 
either be O or p itself. Ilence if p(e) + 0 the representation 
induced in p(e) by is equivalent to the regular representation 
in p 

These considerations are to be supplemented by the following 
remark. If ‘8 is any invariant sub-space of Wt then p(e) is either 
independent of $8 or is contained entirely in ‘§, for those elements 
x of p for which xe lies in ‘8 constitute an invariant sub-space 
of p, which is therefore necessarily either 0 or p itself. 

Now construct successively 


Piler), Pele), °°, 
Pils), Pele), ° °°, 


. e ae 


Pilea)» pa(en), ee 


Each sub-space in this list is either entirely contained in the 
sum of the previous ones or is independent of this sum; on 
retaining only those sub-spaces for which this latter possibility 
is realized we obtain a reduction of & into certain invariant 
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sub-spaces p,(e,). To prove this theorem we need only to note 
that the sum of the sub-spaces contained in the first row con- 
tains at least the vector e,, that on adding to them the sum of 
those contained in the second row we obtain at least the vector 
e, in addition, etc.° 

The theorem just proved is in particular applicable to the 
symmetric group 7, and we now wish to establish the analogue 
for the algebra XY of symmetric transformations in the space / 
of tensors of order f. We already know that #/ can be reduced 
into sub-spaces ‘8; which are irreducible with respect to & 
(provided the number field over which & is defined does not have 
as modulus a prime Sf). Every transformation A of 2 1s at 
the same time a transformation A, of ‘8; on itself and the corre- 
spondence A-> A, is naturally a representation of 2, the 
‘representation induced in $8; by the algebra 2.’’ We wish to 
show that the representations of 2 are completely reducible 
into irreducible constituents, and that each of these constituents 
is equivalent to the representation induced in some ‘$8, by the 
algebra 2. Naturally this does not follow immediately from 
theorem (6.1); in order to establish the connection between 
the two we must show that the complete reducibility of 9 into 
irreducible invariant sub-spaces ‘$, implies the same for the 
algebra 2. We apply the notation and conventions given at 
the beginning of this section to the algebra 2: (A) is the 
correspondence 


S—> S' = AS 


of the ‘‘ vector space”’ 2 on itself, 4 —> (A) the regular repre- 
sentation of 2; the algebra of transformations (4), which is 
simply isomorphic with 2, 1s taken as fundamental in the vector 
space 2, i.e. the transformation group of 2 consists of the 
transformations (A). 

Theorem (6.2). Let & be an algebra of transformations in a 
vector space Kt, and let KR be completely reducible with respect to 
this system X of transformations into irreducible invariant sub- 
spaces J;. Then & 1s itself completely reducible into irreducible 
invariant sub-spaces II;, and the representation induced by the 
regular representation in II; coincides with (more precisely, is 
equivalent to) the representation induced in one of the irreducible 
, by the algebra & itself. 

This theorem holds without any restrictions on the field 
over which 2 is defined. Let JZ be an irreducible invariant 
sub-space of 2’ (consisting not merely of the transformation 0), 
and let R+0 be a transformation of JJ. There then exists 
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a vector a in such that Ra +0. Let a be decomposed into 
its components a, in the various sub-spaces ‘$;; at least one of 
these components, say a; = e, must be carried over into a vector 
Re + 0 by R. We now hold e fixed and let S in 3 = Se run 
through all transformations of IZ; these 3 then constitute an 
invariant sub-space JJ(e) of $8 = $,. The ‘‘ typical reasoning ”’ 
already applied in the proof of the previous theorem then allows 
us to conclude that: 

(1) JZ(e) is either 0 or $$, as $f is irreducible; in this case 
it is necessarily 8, for the vector Re + 0 belongs to I(e). 

(2) S=0 is the only transformation in JZ which sends e 
over into 0, for those S of IZ for which Se = 0 constitute an 
invariant sub-space of the irreducible sub-space JT. Hence 
8 = Se sets up a one-to-one correspondence between J] and §. 

This correspondence is similar, for S’ = AS implies that 
the vectors 8 = Se, 3’ = S’e satisfy the equation 3’ = A8. We 
have thus proved the second part of our theorem: the repre- 
sentation induced in JJ by the regular representation coincides 
with the representation induced in $8 by the algebra itself; 
briefly, [7 is similar to some §,. 

Since Se runs through the entire sub-space $$ when S runs 
through J7 there exists an & in JJ such that He =e; then 
Be =e. Since the transformations E and EE? of JZ both 
associate the same image with e they are identical: & is idem- 
potent. Hence 2 can be completely reduced into two inde- 
pendent sub-spaces JJ + 2” in accordance with the formula 


S= SE + (S— SE). 


[Cf. the proof of Theorem (3.3).] Successive application of 
this procedure leads to the complete reduction of 2 into its 
constituents IT;. 

Having proved Theorem (6.2), we obtain from Theorem 
(6.1), under the same assumptions, the further theorem : 

Theorem (6.3). Every representation of & is completely 
reducible into irreducible representations. Every irreducible re- 
presentation of & coincides with the representation A—-> A, 
induced in some 3, by the algebra 2 itself. 

Theorem (6.1) yields the further (rather uninteresting) fact 
that not only is every JJ; similar to some §8,, but also conversely 
every 58, is similar to some JJ;. 

As has already been indicated, all of these results are applic- 
able to the algebra of symmetric transformations in tensor space 
R/. But we have shown in § 1 that this algebra can be replaced 
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by the group (c)/ induced in tensor space by the group ¢ of 
linear transformations 


Oe 
| a 


Ths 


a(ik) x, [det [a(ik)] = 0] (1.3) 


of n-dimensional vector space, i.e. by the representation (c)f of 
c. We shall say. that a representation of ¢ is of order f if the 
components of the matrix A, which corresponds to the element 
(1.3) of the group, are rational integral functions of the a(zk) 
of order f. Our theorem then asserts : 

Theorem (6.4). Every f™ order representation of ¢ 1s com- 
pletely reducible into irreducible representations, and every irreduc- 
ible representation of order f of ¢ is contained in the representation (c)f. 

This theorem 1s still valid on restricting the affine group ¢ to 
its unitary sub-group u. (Naturally the concept “ unitary’? 1m- 
plies that we are then no longer dealing with an arbitrary field, 
but are operating in the field of all complex numbers.) 


§ 7. Constructive Reduction of an Algebra into Simple 
Matric Algebras 


We again assume that the algebra p of order h, which may 
at the same time be considered as a vector space t of h dimensions, 
is completely reducible into irreducible invariant sub-spaces p,. 
The generating units e,; of these irreducible p; are obtained by 
the corresponding reduction of the modulus; we can then 
express an arbitrary element x of v as the sum of its components 
in the various p, : 


= Die; (ein p,), x= Dixe,. (7.1) 


If q is a sub-space of ¢ we denote by qa the totality of elements 
of the form xa where x runs through all elements of q; e, with 
or without index, 1s an idempotent element, usually primitive ; 
b = te the invariant sub-space generated by e; h the repre- 
sentation of p induced in p by the regular representation. 

We could consider in addition to the reduction (7.1) of t 
into left-invariant sub-spaces the analogous reduction into 
right-invariant sub-spaces by means of the equation 


XS Dex. 
i 


But the most complete separation into mutually independent 
components is obtained by carrying out both of these processes 
simultaneously : 


x= De Xey =— D Xx: (7.2) 
1,% i,k 
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The elements of the form e,xe, are those of character (e,, e;,), 
or briefly (2k). Let p,, be the sub-space consisting of all elements 
of this character. The various );, are independent and the 
entire rt is reduced into the sum of the py; the original left- 
invariant p,=2p,, The important properties of p,, are given 


t 
by the following : 

Auxiliary Theorem (7.3). I. If p, p’ are two inequivalent 
irreducible sub-spaces with generating units e, e’, all elements of 
character (e, e’) are = 0. 

Il. The elements of character (e, e) constitute a field or division 
algebra which is simply tsomorphic with the system of similar 
projections of ~ on itself. 

Proof. I. Let a be any element of character (e, e’). The 
transformation 


[aj: x > x = xa (7.4) 


carries every element x of ) over into an element x’ of p’ and 
defines a similar projection. Conversely, we know that any 
similar projection of p on ’ 1s defined by an equation of this 
form, and that the generating element a of character (e, e’) is 
uniquely determined by the projection. If p and p’ are irre- 
ducible our ‘“‘ typical reasoning ’’ Ieads us to the two usual 
alternatives: cither the projection associates with every element 
x of p the image x’ == 0 or it defines a one-to-one correspondence 
of pon p’. The equation ea = a tells us that the first alternative 
is possible only if a = 0, and the second implies that p and p’ 
are equivalent. 

II. The above remarks are applicable to an element a of 
character (e, e) and the similarity projection of ) on itself which 
it generates. If p is irreducible every such projection, except 
the one defined by a = 0, 1s one-to-one and consequently has 
an inverse. But the existence of an inverse is identical with 
the possibility of division. The isomorphism asserted in the 
theorem is apparent on reversing our usual procedure, and 
reading the resultant of two or more correspondences from 
left to right, for the resultant of the correspondences 


4 


Sere) Re leet 
is given by 
x’’ == x(aa’). 
We now proceed with the help of this auxiliary theorem as 
follows: Arrange the p; into classes of equivalent sub-spaces 
with generating units 


/ n" n" 


, 
Qy °° 8, Cy; €4) er es) 


REDUCTION OF AN ALGEBRA 311 
and add together the generating units in each of these classes : 
e+: ° -texe’, ey +: ‘ -+te=e” ce 
We then have 
l=e+e*’+-:: (7 
r=rvf+r+-:-:: (7. 
where rt’, t’’, - + + denote the inequivalent sub-spaces te’, te”’, «+ > 
into which t is reduced. 
Part I of the auxiliary theorem above then tells us that, 
for example, 
e’xe’’ = 0, 
Hence the product a’a” of two elements belonging to different 
sub-spaces vt’, t’’ is always 0, and the reduction 


a=ata’+++ + =a tas’ 4+::-: 
leads to the multiplication rule 
ab =a'b’+ab’4+--- 


From this it follows that r’ is both right- and left-invariant and 
a fortior: constitutes an algebra p’ (“‘ invariant sub-algebra ’’) ; 
e’ is the modulus of p’.. The given algebra is then the direct sum 
of the simple algebras p’, p’’, +++, where the precise meaning of 
direct sum is defined by the following : 

Let p’, p’, «+ + be algebras (defined over the same field), and 
consider as the elements of a new algebra p, the direct sum of 
p,p’,---, all sets 


consisting of an arbitrary element a’ of p’, an arbitrary a” 
of p’, +++. The fundamental operations in p are defined by 
(a’, a”, oe -) - (b’, b”, i. *) — (a’ +. b’ a’ + b’, ee Oe =) 
N(a’, a’, + + +) c= (Aa; da", + + +), 
(a’, a’, Sn \(B', bY & +) a (a’b’, ab” mw -) 


where A is any number. 

Note that the central of the algebra p obtained by direct 
summation is the direct sum of the centrals of the individual 
algebras p’, p’, + °°. 

We investigate in detail one of these simple sub-algebras, 
say p’, which we now denote simply by p; its modulus e’ 
may now be denoted by 1. On omitting the primes, the de- 
composition of 1 into equivalent primitive idempotent elements 
é, 1s expressed by 


L=e+é,-+°° "+ ey. 
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Every element a of p is reduced in accordance with the formula 
(double Peirce reduction) 


into components of characters (1k). The component c,, of the 
product c = ab is easily seen to be expresscd in terms of the 
components a,x, 5;, of a and b by the equation 


: 
Cik = Pts ae 


We have thus already obtained the connection between our con- 
siderations and the matrix calculus, 

The invariant sub-spaces p,, Po, °° *, ~, generated by the 
€1, €o, °° *, é, are all equivalent. Let p be any of these classes, 
e.g. p = p,, and let I’, be any fixed one-to-one similarity corre- 
spondence of p,; on p. In accordance with (7.4) any element 


a= 43> € Ae x 


of character (e,;, e,) generates a similarity projection [a] of p, 
on p,; this projection can be written in the form 


[a] = Lal? (7.7) 


where a@ is a similarity projection of p on itself. But by Part II 
of the auxiliary theorem proved above the similarity projections 
of p on itself constitute a field (division algebra) ® which is simply 
isomorphic with the set of elements of character (e, e). If ® 1s 
of order v each of the 7 left-invariant sub-spaces 


Dx = 3 in 


is of dimensionality g=r-:v. The number of times r an irre- 
ducible representation occurs in the regular representation 14s 
accordingly a factor of the dimensionality g of the representation. 

Any element a can be reduced into its components aj, 
which may be any eclements of the independent sub-spaces pj,. 
In accordance with (7.7) 


[ax] = Dyail ye (7.8) 


and a,;, may be replaced by the corresponding element a,, of 
the field ® Since conversely any such element a,, is by (7.8) 
associated with a similarity projection [a,;] of p; on p,, and there- 
fore with a definite element a,, of character (7k), we obtain 4 
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one-to-one reciprocal correspondence between the totality of 
all elements a of the simple algebra p and the totality of matrices 


Oyy yn * * * Ayer 
hoy %ee ° °° Ker (7.9) 


ory Are * * * er 


of order r whose components a«,;, are clements of the field ®. 
The correspondence is such that to the three fundamental 
operations of the one (addition of elements, multiplication of 
an element by a number and multiplication of two elements) 
correspond to the same operations of the other. Note that in 
particular 


[43 5x] v7 [2,5] [B54] = Pyayl Fis 7 iPad | k ; 
= D;- anBs.° Ty’. 


We have thus proved : 

Wedderburn’s Theorem.® Any of the simple algebras, whose 
direct sum constitutes the given algebra p, 1s simply 1somorphic 
with a simple metric algebra in a certain field (division algebra) 
® defined over the field of the original algebra. 

(Remark, The invariant sub-spacc p, consists of all clements 
a such that the matrix ||«,,|| has as its only non-vanishing column 
the k'*. The element e; is then described by that diagonal 
matrix all of whose components vanish except the one occupy- 
ing the 2" place, which is 1.) 

It is readily seen that the central of the simple algebra p 
consists of those elements whose matrix (7.9) is of the form 


oe 0+ = 20 
0 a::: 0 
GO: Qos 2 ay 


where a belongs to the central of the field @. 

Our construction was divided into two steps. First rt was 
completely reduced into the sub-spaces vt’, t’’, - + + which are 
both right- and left-invariant and then these were further 
reduced into the left-invariant sub-spaces p;. We must now 
return to the consideration of the first step. On multiplying 


xs’ on the left by (7.5) we find 
AG = € XE, 
and on multiplying e’x on the right by the same factor 


ex == 6 xe. 
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Hence 
bd ie te SHG 


the e', &'', +++ commute with all elements and belong to the central 
of the algebra. The sub-spaces t’ = p’, t’’, +++ are both right- 
and left-invariant in the sense that neither the transformation 
x’ = xa nor x’ == ax leads out of them, and they are furthermore 
irreducible in this respect-—indeed, it is for this reason we call 
them “‘simple.’’ In order to show this we proceed as follows : 

(7.10). If to is a sub-space which is both right- and left- 
invariant then cither e; is contained in ft, or tye; = 0. For 
T)é; 1S an invariant sub-space of the irreducible p,; and 1s there- 
fore either 0 or p; itself. In the second case we have 


DP, = Tres 3S Ur 


since Y) 1s right-invariant ; hence e,; is contained in f4 
(7.11). If e; is in ty the same is true of any e which is equi- 
valent to e;. For the similarity projection x’ = xb of p; on p 
associates e with some element a, of p, by means of the equation 
e = a,b, and since 4a, 1s in Y%, e 1s also. 
(7.12). If ty 2 v’ then since ty = Ytye, not all the tye; can 
1 


be empty, i.e. one of the e, must occur in ty. But they must 
then all occur in tf, hence also e’ = Sve,, and consequently tq = 


(7.13). Again let ry be a right- and left-invariant sub-space. 
Then either tye’ =’ or it is empty; in the former case é’ 1s 
in Y. It follows from 


To = oe +1)e°+-°-° 


that t, is necessarily the sum of certain of the spaces 1’, 1’, + °°; 
when in particular Y) 1s irreducible in the sense of right- and 
left-invariance it must coincide with one of the r’, v”, 
Hence the reduction (7.6) 1s unique. This further shows that 
every right- and left-invariant sub-space Yg possesses a generating 
unit 2 which belongs to the central of the algebra, and that t 
can be completcly reduced into ty and a supplementary right- 
and left-invariant sub-space. 

(7.14). If p is an irreducible (left-) invariant sub-space with 
the generating unit e, then pe’ is invariant, and since pe’ = e’p 
it is either 0 or p itself. Since 


p= pe’ + pe pos 


the equation pe = p must hold for some one of the e’, &” 
while for all others pe = 0. We then say that ¢ belongs to bp 
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and that conversely e or p belongs to €. is a sub-space of the 
right- and left-invariant ve. 

An algebra p = rt, concerning which we only assume that it 
is completely reducible into irreducible invariant sub-spaces p,, 
is necessarily obtainable by successive application of the follow- 
ing processes : 

(A) Construction of a field ; 

(B) Transition to matrices: we take as elements the matrices 
of a fixed order r whose components are arbitrary elements of 
the field ; 

(C) Direct summation. 

The processes (B) and (C) are formally completely determined 
and are therefore of an elementary character. Hence the 
construction of algebras is reduced to the construction of fields, 
i.e. of special algebras in which division is possible (‘‘ division 
algebras ’’). 

The converse is naturally also truc: any algebra constructed 
by the three steps (A), (B) and (C) 1s completely reducible, for: 

(A) If the algebra r is itself a field, r is itself an irreducible 
sub-space of r. For if a is any non-null element of the field 
then €a runs through the entire field with €; this is merely 
the content of the division axiom. 

(B) The matrices (7.9) in which all components of every 
column except the 2‘ vanish constitute the irreducible sub- 
space p,, and the space rt of all matrices is the sum of these p,. 
p, is irreducible ; to show this we must prove that if a is any 
element in p, then any element of p, can be expressed in the 
form xa. a as well as a’ = xa has as its only non-vanishing 
column the 7"; dropping the last index 1, we denote these two 
columns by 


(a, he, * * *, ar), (0, Oe, eee a,), 


respectively. The equation a’ = xa is then 


: 
a; = oF ik%k ) 
k=1 


we are therefore concerned with proving the theorem that any 
non-vanishing “‘ vector’’ (a,a, °° + #,) can be transformed into 
any given ‘‘ vector” (aa, °° + a) by an appropriate linear 
correspondence. Since not all the a, vanish take one of them, 
say %, which does not vanish and let all €, for which k + 2 
be 0; &,, 1s then to be determined by the equation 


a; = E ios | 


that this is possible is guaranteed by the division axiom. 
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(C) The assertion is self-evident for this step. 

In general only the first step, (A), does not lend itself to an 
exhaustive formal treatment. However, 1f the field over which 
the field (‘‘ division algebra’’) referred to in (A) is defined 1s 
algebraically closed this step becomes extremely simple: 

The only division algebra of finite order over an algebraically 
closed field is this field itself. 

Proof. Consider an algebra of order v defined over an 
algebraically closed field. If @ is an clement of the algebra 
there must exist a lincar dependence between the v + 1 powers 
a’, av}, +++, a, I, i.e. a linear relation whose coefficients are 
numbers of the field. Hence a satisfies an algebraic equation 
of degreem Sv: 


f(A) = )\m + y,An™} + ee 3g. 48 +. Vn 
f(a) =a™+ yam? 4++-+y,1= 0. 


Since the field is algebraically closed f(A) can be expressed as 
the product of linear factors : 


f(A) = (A — a )(A = a) 2s (A am). 
Correspondingly 
(a — al)\(a — al) +--+ (a—a,l) = 0. (7.15) 


We now introduce the assumption that the algebra of order v is 
a division algebra; then the product of two or more elements 
can vanish only if one of the factors is 0. Hence we may con- 
clude from (7.15) that a= a,1 for some 1; the algebra then 
consists of the products of the modulus 1 with any number of 
the fundamental field, and therefore the algebra itself is simply 
isomorphic with this field. 

If we are dealing in the ficld of all complex numbers the 
auxiliary theorem (7.3) can be replaced, in accordance with 
the above, by the more definite : 

(7.3'). All elements of the form ex'e are zero if the primitive 
idempotent elements e, e° are inequivalent. If they are equivalent 
all such elements are multiples of one of them (which is different 
from 0). 

Further: The number of times an irreducible representation 
appears in the regular representation is not merely a factor of the 
dimensionality of the representation ; it 1s actually equal to it. 
Our analysis has thus revealed the true source of this remarkable 
fact. 

Under these circumstances the given (‘‘ semi-simple’) algebra 
ts the direct sum of simple matric algebras over the original field. 
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We obtain a complete set of basal units ¢,, e7,° °°: 
a= Sopen + Loe, (7.16) 
tk x 


for the algebra; these basal units satisfy the multiplication 
law of ‘‘ matrix units,” i.e. products of the type 


Cink = Ck (7.17) 
and all others vanish. The correspondences 
a—> loll, 4 > [lel 


are the inequivalent irreducible representations ’, 9”, -- ° 
The basal units e;,, e”, ° : : are the generating units ¢;,¢,° °° 
of the irreducible sub-spaces p,; with which we began our con- 
struction. e, is the element of character (1k) generated by 
the correspondence I';I’y* of p; on px, 1.e. that element which 
this correspondence associates with e. 

After having obtained the irreducible representations in 
this constructive way we derive their orthogonality properties 
again from our present standpoint. For the moment let the 
trace of a denote the trace of the correspondence 


“> Y= ax (7.18) 


of r on itself which is associated with a in the regular repre- 
sentation. In terms of the co-ordinate system defined by the 
basal units above this correspondence becomes 


g’ 
a / , 
Vik = D4, § 56, aac 
j=l 
Each of the g’ columns of variables 


Sik are Be Bees eo ( ai I, 2, en g’) 


/ 


undergoes the transformation with matrix |la,||; the trace of 
a is accordingly 


g’ 

Bo Bias, a 6 * 
By (7.16) this is equivalent to the equations 
0 (1 + k) 
tr (é;,) = 4 ,7- jn oe 
ee te (1 = R) 
for the basal units. Hence by (7.17) 

tr (€4@n;) = 8, °° (7.19) 
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and all other types of products of basal matric units have a 
vanishing trace. 

If the algebra is the algebra of a group of order hk the corre- 
spondence (7.18) is expressed in the original co-ordinate system, 
consisting of the elements s associated with the elements s of 
the group, by the equation 


Vis) == Jalst?) x(t). 


From this it follows that the trace, as defined above, of a is 
equal to h-a(l); but in the case of a group algebra we have 
previously called a(l) itself, without the factor h, the trace of a. 
On returning to this original definition of the trace we need 
merely to replace the right-hand side g’ of the orthogonality 
relations (7.19) by g’/h. Equation (7.16) may now be solved 
explicitly for the coefficients : 


2 wir Ge > als) - &f,(s-?). (7.20) 


The connection with the development in Chapter IIT, § 13, is 
obtained by noting that the 


x,(s) = . - e,(s-) (7.21) 


are the components of the matrix U’(s) associated with the 
element s of the group in the irreducible representation }’. 
The character of h’ 1s therefore 


xX (s) = > -e(s73) | (7.22) 


and (7.19) yields the orthogonality relations for the representa- 
tions. 

We have thus arrived at a constructive formulation of the 
theory, in which the fundamental concepts involved in and the 
range of validity of each step are clearly apparent. It supplies 
us with a constructive method for obtaining a complete set of 
irreducible representations, as well as establishing the ortho- 
gonality relations. 


Additional remark. In dealing with the continuum of all complex 
numbers and a group algebra defined over this field we can, in accord- 
ance with the remark at the end of § 3, completely reduce the modulus 1 
into real primitive e; and the space r into the corresponding unitary- 
orthogonal irreducible y,, Further, the projections I, can be normalized 


in such a way that ae is conjugate to Cs To show this we note that 
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the conjugate of ey is under these conditions an element of character 
(kt) and must therefore be the product of en; by a number y»,,: 


Cin = Mae’ eke (7.23) 
The rules a 2 a 
Cee = ip Su = Mn ak 

yield the conditions 
YaeYeR = ¥iv Ya = 1 


on the coefficients. Further, y,, is real and positive, for from (7.23) 
and (7.19) we find 


rd 
a ~ 


Zleie(s)|* = trlepeig) = =. rep. 
8 


We then find that the y,, can be brought into the form y,, = B{/B;, where 
the B, are positive real numbers (take, for example, RF = y,,). On re- 
placing the original correspondences I, by 8,I, we find that the new e'ki 
is actually conjugate to the new bas Our representations f), h,- °° are 


accordingly thrown into unitary form. 


B. ExtTENSION OF THE THEORY AND PHYSICAL APPLICATIONS 


§ 8. The Characters of the Symmetric Group and 
Equivalence Degeneracy in Quantum Mechanics 


The notation employed in this section is as follows: w= ay 
is the symmetric permutation group of f things, t = p = (z) 
the corresponding algebra, e a (primitive) idempotent element 
of p, p = te the (irreducible) invariant sub-space of t generated 
by e, § the representation induced in p» by the regular repre- 
sentation, g the dimensionality of p and b, x the character of 
h, ¢ that element of the set e’, ¢’’, - - + (7.14) to which the irre- 
ducible » belongs; ‘8 the corresponding symmetry class of 
tensors of order f, consisting of all tensors of the form @F, © 
the representation’of the algebra }' of symmetric transformations 
(and therefore of the linear group ¢) which 1s induced in % by 2 
itself. When further differentiation is necessary, we also denote 
this § by (x) or %,(x). In case the considerations are valid 
for an arbitrary finite group 7, A denotes the order of m7 (= f! 
for 7). 


Determination of the Group Characters. 


We begin by calculating the character of the representation b. 
To this end we construct the trace of the linear correspondence 


x—-> y= ax (8.1) 
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of p on itself; the considerations of the previous section show 


it to be 
Sals)x(s) 
Now consider instead of (8.1) the projection 
x—> y = axe (8.2) 


of the total space t on p; it coincides with (8.1) within p and 
sends any element x of t into an element y of p. On choosing 
the co-ordinate system in r in such a way that the first g funda- 
mental vectors span the sub-space p, the last h — g rows of the 
matrix of (8.2) consist only of zeros; hence the trace of the 
projection (8.2) of the total group space is equal to the trace of 
the correspondence (8.1) inp. In terms of components equation 
(8.2) is 


y(s) = La(t)x(s)e(t’), (ts't! = 5) 
and the trace is therefore 


AX a(sje(t) 


where the inner sum 1s extended over the pairs ¢, t’ of elements 
of the group which satisfy the equation ¢tst’ = s, or explicitly, 


the trace 1s 
>» (a(t) De(s 4s) }, 
t 8 


Hence the character y of ) is given by 
x(t) = Le(s tts) 


or 


x(s) = Le(rs-r~). (8.3) 


r 


In particular, the dimensionality g of the representation (and 
the space ) is 


Resonance or Equivalence Degeneracy. 


The significance of our results for quantum mechanics, as 
first recognized by Wigner, is the following.? The complete 
reduction of the tensor space #/ into invariant sub-spaces §, 
implies a separation of the terms of the physical system J/, 
consisting of f equivalent individuals J (electrons), into sets of 
terms which no dynamical influence whatever can cause to 
enter into combination with each other. We have further seen 
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that the reduction of R/ into the $; parallels the complete 
reduction of the total group space rt of the symmetric permutation 
zroup 7 into invariant sub-spaces p,. Hence there is a system of 
terms associated with every irreducible representation h of +— 
which we denote simply as the term system x, using the 
character x of ) as a name for the system—and the multiplicity 
of this term system is the number m(x) of times that § occurs 
in the regular representation. This suffers a slight modification 
in case x < f, for we must then ignore all p; which are not con- 
tained in t, = La, But since fp, 1s both right- and left-invariant, 
all sub-spaces which are equivalent to an irreducible invariant 
p lying in Y) are also in fg. Hence the multiplicity of the term 
system y is m(x) or 0 according as that ¢ with which the character 
x is associated by {7.22) is in tg or not. From the physical 
standpoint, the only additional fact of interest obtained from 
the more extended theory built up on the assumption that the 
number field in which we are operating is algebraically closed 
is that then the multiplicity m/(x) is equal to the dimensionality 
g of the representation §. Furthermore, it is impossible to 
resolve this multiplicity by any physical means whatever, for 
corresponding terms in these various term systems remain in 
coincidence under all dynamical influences. 

We consider the resolution of terms in the case in which the 
interaction between the f individuals is expressed by a small 
perturbation energy AW, neglecting higher powers of the small 
parameter A. Assume for the moment that the energy levels 
E,, Es, - ° + of a single individual I are non-degenerate. On 
neglecting the perturbation J/ possesses energy terms of the type 


B=F,+ baet+-- ++ &;; (8.4) 


we first concern ourselves with such a term. Its multiplicity 
is f! and the corresponding co-ordinates in tensor space are the 
coefficients F(i,, ig, - - - is) whose indices are any permutation 
sof 1,2,--°-+,f. This coefficient F(1,7, ++ + 7,) is the component 
x(s) of the element 


x= F(1,2,--:, f) 
of the algebra (7). The separation of the term (8.4) is to a first 


approximation determined by the reduction of the correspon- 
dence 


F(tytg eras iy) = 2 a(tate aiid 1y | kik, * ky) F(RyRe —< ky) 


to diagonal form ; here the matrix of the coefficients a represents 
the energy and i, 3, ° °°, 17; Ry, Re, °° *, Ry are permutations 
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s, t of 1, 2,°++, f. This equation may therefore be written in 
the form 


t(s) = DYa(s, t) x(t). (8.5) 
The equation : 
a(ty ttt dys Ry ttt Ry) Salty +s ty; Ry s+ + ky) 
describing the symmetry of a, in which 
los fof 
s any fixcd permutation r, is expressed by 
a(sr, tr) = a(s, t) 


for the only coefficients in which we are here interested ; r is here 
considered as applied to the indices 1, 2,- ++, f themselves rather 
than the sub-indices. Hence a(s, t) depends only on st7?: 


a(s, t) = a(st™), 
and equation (8.5) may now be written in the abbreviated form 
(4) 58 =a (8.6) 
where a, x, © are the symmetry elements of the algebra (a) with 
components a(s), x(s), x«(s). 

On restricting ourselves to an invariant irreducible sub-space 
® of the system space WW the element x of (7) lies in the corre- 
sponding p. The g terms W,, W,, +--+, W, into which (8.4) 1s 
resolved by the perturbation and which belong to the term 
system x under consideration are, to the approximation involved 


in the perturbation theory, the characteristic numbers of the 


correspondence (8.6) of p on itself... The sum of these terms must 
therefore equal the trace of this correspondence, or 


Wit Wet-ss ++ Wi = SLals)x(s). (8.7) 


The sum of the squares of these terms, of their third powers, 
etc., are obtained by reiterating the correspondence (a), 1.e. 


Wi + Wht +++ +WE=Tals)x(s), (8.7) 
where the a,(s) are the components of the symmetry element 
at: 


Qo(s) == 1 or 0, according as s = | or + I, ) 


az41(S) = dia,(st™)a({). j oe) 


As soon as the “ exchange energies’’ a(s) are known we can 
apply this formula to calculate those of the terms arising from 
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(8.4) which are contained in the term system y; for this we need 
only to know the character y—it is not necessary to have an 
explicit expression for the idempotent generator e or the 
representation } of =. 

These considerations are immediately applicable only if we 
ignore the spin phenomena. If we take into account the per- 
turbation due to the interaction of the electrons before that 
due to the spin, as in the case of normal term order, the mere 
existence of spin implies that each of the energies E£; is at least 
two-fold. We shall later concern ourselves with the far-reaching 
modifications caused by the spin and by the Pauli exclusion 
principle, which enables us to discard the majority of possible 
terms. 

The unperturbed Jf will have, in addition to terms of the 
type (8.4), terms in which groups of two or more summands 
appear with the same indices. The multiplicity of the term 


Ey +hba tes +h Aathtsss+h=f) (89) 
with integral non-negative weights /, is but 
f! 
fifl--- ft 


The corresponding tensor coefficients x(s) are those obtained 
from 


(8.10) 


0 ee 22 ee a ) 
i iP 


by the permutations s of the f arguments. But a permutation 
p is without effect if it only permutes the first f, indices among 
themselves, the next f, among themselves, etc.; we may no 
longer distinguish between the permutations s and ps-—they 
must be considered as giving rise to but one component. Such 
permutations p constitute a group 7’ = m(f,, fo, - ° *) of order 
h’ = filf,! + + °, and two permutations s, ¢ are to be considered 
as the same if they are left-cquivalent with respect to this sub- 
group 7’, i.e. if s = ¢ (ps = t, where p is an element of 7’). The 
only elements x of the algebra (2) in which we arc now interested 
are those which satisfy the equation 


x(t) = x(s) when ¢ = s (mod. 7’) ; 
they constitute a linear sub-space t’ = t(7’) of dimensionality 
(8.10). More precisely, vt’ is a right-invariant sub-algebra, for 
ifs = ¢then also sr = tr. Again a(s, t) = a(st“1); further 
a(ps) = a(s), a(sp) = a(s) 


if p is in 7’, 
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We are now concerned with the correspondence x > Zin r’: 
a(s) = Dia(st“)x(t) (mod. 7’), (8.11) 
t 


6 9 


where the ‘‘ mod. 7’ ” indicates that both s and ¢ run through 
a complete set of elements of the group which are inequivalent 
mod. 7’. As x runs through 1’, ve generates a sub-space p’ of 1’ 
which is transformed into itself by the correspondence (8.11), 
and the reduction of this correspondence of p’ into diagonal 
form yields those terms arising from (8.9) and lying in the term 
system x. The trace of (8.11) in p’ 1s equal to the trace of the 
correspondence A,: x— 2 in t’ which ts obtained from (8.11) 
by replacing x by ve, 1.e. x(t) by 


Lair )elr) = Za(rje(r'). 


Hence 
tr(A.) = SY {a(st71) Se(r—1)}. 
s,¢mod. x/ r=s 
Since a(st~!) = a(rt“!) when r = s (mod. 7’), this trace may be 
written 
» pdalrtye(r-4). 
tmod.x2’ r 
Naturally this sum does not depend on which particular element 
t we have happened to choose from the set of group elements 
which are equivalent mod. 7’; hence on dropping the restriction 
on the range of ¢ the above sum 1s multiplied by the order h’ 
of 7’: 
1 aes 1 
tr(A,) = pear Neg) == ji XAls)x(S). (8.12) 
Here again x(s) is the character of ) as determined by (8.3). 
In particular, the dimensionality of p’, t.e. the number of terms 
in the system x arising from (8.9), is obtained by replacing the 
symmetry element a in (8.12) by the element a, defined by 


ao(s) == 1 or 0, according as s = J (mod. 7’) or not: 


this number 1s consequently 


(8.13) 


We express this result, the validity of which is not restricted 
to permutation groups, in the theorem : 

Let 7’ be a sub-group of 7 of order h’ and let » be a left-invariant 
sub-space of the group space t of m. Consider the elements x of 
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the algebra (a) which satisfy the condition x(s) = x(t), where s and 
t are any two elements of the group m which are left-equivalent 
mod. 7’; the elements of (m) which are of this type and which 
lie in ) constitute a linear sub-space whose dimensionality 1s given 
by (8.13), where x 1s the character of the regular representation in ). 

The sum of the terms is equal to the trace (8.12), and the 
sums of their powers are given by 


2 4,(S)x(S) 
Pa al 1 rae (8.14) 


The only way this result differs from (8.7’) is by the introduction 
of the denominator f,!f,! -- + and the fact that a,(s) is now defined 
by 
a, (Ss) = Da,(st™)a(t) (mod. 7’). 
t 


Degenerate Case. Denote the numerically different energy 
levels of the individual J by EF’, FE”, +++, and the multiplicity 
of E® by n,. We now distinguish between the various variables 
having the same “‘ principal quantum number ”’ p by an “ auxil- 
lary quantum number ”’ k, which assumes 7, values. An energy 
level of the type 


6 


Bop EU 4. ++ 4 BW (8.15) 
of the unperturbed total system J/ has the multiplicity 
filmy ng + + + ny, 


and the corresponding tensor coefficients are those obtained 
from those of type 
F(; 2 f 


tote 


by any permutation s of the f pairs (v|k) of arguments ; we write 
instead 


x(s|kiRkp - + + Rs) or briefly x(s|k). 
Similarly the coefficients of the energy matrix are denoted by 
a(s|kyRs s+ + Rys tlle + + + Ly) = (stk; J). 


The energy levels W arising from (8.15) by the perturbation 
and lying in the term system x are, to a first approximation, 
determined by 

LW = YL a,(s\k; k)x(s), (8.16) 


(k) 8 
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where a,(s|k; 1) = 1 or 0 according as s = 1, k = or not, and 
the composition is defined by 


ar (s|k; l) = Ya,(st|k; m)a(t|m; I). (8.17) 
t,(m) 
If the unperturbed energy level is of the form 
eos + f° Rh" + uD So my (f’ + - tL st tee ee ay) 


the tensor coefficients in which we are interested are those ob- 
tained from 


1 1 S Oo ward ees 
at bt aa ay Ro tt lot . 
ta i 


Let exactly f, of the auxiliary quantum numbers ,,(v = 1, 
-++ f’) have a certain value k,, fo a different value kg, etc. ; 
fitfet:::=f', and let f;, f6,° °° have the analogous 
meaning for the quantum numbers k,,(v = 1,---+, f’’) associated 
with the principal quantum number 2, etc. Then those per- 
mutations p which leave the above tensor coefficient unchanged 
constitute a certain sub-group 7,, depending on the distribution 
of auxiliary quantum numbers k, of the group 7’ introduced in 
the non-degenerate case above; the order of a, is [k] = f;!f,! 
sos fills + + a(sik; 1) isunchanged when s is multiplied on the 
left by an element of 7, and on the right by an element of 7. 
The formula (8.16) now becomes 

SW = SI — yals|k; k)x(s)| (8.18) 

k | [F] 8 f 


ao(s|k; 1) == 1 or 0 according as k=/ and s = I (mod. z7,) 
or not, and in the composition rule (8.17) we first sum with 
respect to t mod. 7,, and then over the various possibilities 


m = (My, My, ° °°; My, + °°3 °° *). 


In every case we obtain explicit expressions for the sums of 
the various powers of the perturbed energy levels in terms of 
the character x of the term system under consideration and the 
exchange energies a(s). 


§ 9. Relation between the Characters of the Symmetric 
Permutation -and Affine Groups 


The thorough correspondence existing between the repre- 
sentations of the symmetric permutation group ay and the 
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representations of order f of the linear group ¢ must lead to 
a simple relation between the corresponding characters. In 
dealing with the linear group it suffices to consider only the 
‘“ principal transformations ”’ 


Ki );,%, (T= 1, 2,°+ +n) (9.1) 


of the vector space fi == W,, for any linear transformation is 
conjugate within ¢ to a principal transformation—except for 
those cases in which two or more of the characteristic numbers 
€, coincide. Furthermore, if we restrict ourselves ab initio to 
the unitary group u—the one in which we are interested in 
physics—the result 1s valid without exception and the ¢€, are 
complex numbers of unit absolute value. The problem here 
proposed is identical with that of investigating the distribution 
of the terms of // among the various term systems y in the 
absence of interaction between the various individuals and when 
the single system J is non-degenerate, for on choosing a Heisen- 
berg co-ordinate system x; in the system space of J (i.e. one in 
which the operator representing the energy of J is in diagonal 
form) the variable x, assumes the multiplicative factor e(— 4 


h 


in time ¢. 

We denote the characteristic * of the representation of 
the linear group whose substratum consists of all tensors of the 
form @F by X(S) or X(e,, €3, ° + *, €,) where the element S of ¢ is 
the principal transformation (9.1). The €, are to be considered 
as n independent variables. The transformation of tensor space 
associated with (9.1) consists in multiplying the coefficient 
F(t, ta, ** *, ty) of the tensor Ff by €; °€; ° ++ &y. The sum of 
all these multipliers, extended over all linearly independent 
coefficients of a general tensor of the form F’ = éF, is the desired 
characteristic. A component in which f, of the arguments 7 are 
equal to 1, fg are equal to 2,- - » is multiplied by ef1- e%- - + ef, 
But the number of linearly independent components of Ff” of 
this type is, by equation (8.13), 


(9.2) 


here y is the character of the representation of a;, the sum 
being extended over all elements s of the group a’ = (fi, fe, ° °°) 
which permutes the first f, numerals among themsclves, the next 
f, among themselves, etc. That this number (9.2) depends only 


* We prefer, for the sake of clarity, hereafter to employ the word 
“‘ characteristic ’’ for continuous and ‘‘ character ’’ for finite groups. 
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on the character x 1s a fact of greatest importance for our present 
considerations. ‘The result is ° 
efi of: e 8 \ 
Ken e, J = 2 {A yy 9.3 
( 1, £2 ) ae fil fe! Se 2X ) ( ) 
where the inner sum is extended over all the elements s of 
m(f,, fo °° *). We denote the value of the character y for an 
element s belonging to the class £ of conjugate elements of 
a, by x(£) ; our formula may then be written 


e e e k) 
oe 2 x) eA rg, ee (9.4) 
where cy, . . . (k) is the number of elements of a(f,, fe, « °°) 


belonging to the class f. This number can be evaluated in an 
elementary manner. 


Distribution of Permutations in Classes. 


Any permutation s is a product of cycles, no two of which 
contain a common numeral. The 5-term cycle (1 3 7 2 4) 1s 
a permutation which sends 1 into 3, 3 into 7, 7 into 2, 2 into 4, 
and 4 into 1 again; writing these 5 numerals at equidistant 
intervals on the rim of a wheel, this permutation may be con- 
sidered as the rotation of the wheel about the angle 27 5. Given 
any permutation, for example 


123456789 
nn en (9.5) 
347198265, 


the cycles may be separated out by first determining the number 
(3) into which 1 is transformed, then the number (7) into which 
3 is transformed, etc., until a number is obtained which has 
already appeared in the cycle; this number can, of course, 
only be 1. After separating out the first cycle the remaining 
numbers can be handled in the same way, and the process may 
be continued until the desired result is obtained. The per- 
mutation (9.5) is, in terms of its 3 cycles, 


(1 37 2 4) (5 9) (6 8). (9.6) 


The reduction of an arbitrary permutation into its cycles is 
obviously unique. This way of writing the permutation enables 
us to tell at a glance whether two given permutations are con- 
jugate in a, or not, for an element conjugate to (9.6) is obtained 
by replacing the numbers 1, 2, 3, 4, - +--+ by the same numbers 
in any order. The class £ to which an element s belongs is thus 
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determined entirely by the number of cycles and the number 
of integers they contain; in particular, any permutation s and 
its inverse s~! belong to the same class. We denote the class 
£ whose elements s consist of 7, cycles with one numeral, 7, with 
two, 73 with three, «+ + by (217973 + + +) and write x(f) = y(t, 22° °°); 
naturally 


li, + 2i,+34,+-:'=/f. (9.7) 


The number K of classes is the number of solutions of (9.7) with 
non-negative integers 2, 19, t3, °° °. 
The number of elements in the class f = (2,292, + + *) Is 


n(f) = erie enrre (9.8) 


Taal Qte tl Baal + 


To show this we write the f integers 1, 2,°- +. fin any of the 
f! possible orders and divide off each of the first 7, integers by 
parentheses, then divide off the next 27, in groups of 2, the next 
32, In groups of 3,° ++. The symbol so obtained is to be inter- 
preted as the expression of permutation in terms of its cycles. 
Each of the f! possible arrangements so obtained leads to a definite 
element s of the class f, and all such elements must be included. 
We must now investigate how often the same s occurs among these 
f!. Now the 5-term cycle (137 2 4) can also be read as (3 7 2 4 1), 
(72413), ete.: the particular integer with which we begin 1s 
immaterial; such a cycle will occur five times. Hence those 
I 2'2 3's + + + arrangements which differ only by a cyclic per- 
mutation of the numerals in each cycle are all associated with 
the same element s. Furthermore, the 7, 1-term cycles may be 
written down in any order, the 1, 2-term ones in any order, etc., 
and these 2,!7,! +--+ arrangements all lead to the same element s. 
Hence each element occurs exactly 1 7,!2'27,! + + + times, and the 
total number of elements in the class is accordingly given by 
(9.8), 

We must also determine the number of elements of £ which 
are contained in the sub-group a(f,, fo, °°). For this purpose 
we divide the numbers from 1 to f in sections of lengths fi, 
fe, + + + and consider only those permutations s which permute 
the numbers of the first section among themselves, the numbers 
of the second among themselves, etc. On dividing s into cycles 
as in the above some of the cycles will be contained in the first 
section, i.e. will consist only of numerals belonging to the first 
section, some will be contained in the second section, etc., and 
no cycle will consist of numerals belonging to different sections. 
Denoting the number of 1-term cycles contained in the first 
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section by 7,,, the number of 2-term cycles in this section by 
149, etc., whence necessarily 


lity) oi 2219 = 3243 ae a = fi, 


the number of permutations of 1, 2, °° +, f, satisfying this 
requirement is, by (9.8), 


ii! ] 
ieee eee (9.9) 


Proceeding analogously for the 2°4, 3° etc., sections, the number 
of permutations in m(f,f,° + *) satisfying all our requirements is 
given by the product of all numbers of the form (9.9) for the 
various sections. But such an element is a member of the 
class f = (2,2, : + +) 1f and only if 


Dita == 14, Ditag = Ie 15 (9.10) 
hence 
1 fl 
offs ss (f) = 112": et S{ | lam i } 
(2) a 


where the sum is extended over the various solutions of equations 
(9.10) and 
LVvly = fr, LV ly = fo, oe 


The inner sum in (9.4) is accordingly 


] Etta EP aa 
pet sd tte een 


aL e 1a9: 
(t) a 


the only restriction on the sum being the conditions (9.10). Let 
ous ae a . "+ En, 
Og = E] + 2 te + + + &, 


Our results can be expressed entirely in terms of these sums of 
powers, for by the multinomial theorem 


Etat 1 " 
7 ores & 
let: ty: 

Oo . 


(1) 


Ey 7" 02 1 i, 
pf = yee 
1x2: 19: 
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where the variables 7,,,142, °° *°, over which the sum is extended, 
are subject to the restrictions (9.10). We thus finally obtain 
the simple formula 


x(t, 15 ea ‘Jot! ay ea ; 

€y, 9, ° ° *, En) = 1, 91 “7 

X( ty “2s sea) Sion ° —_— (9.11) 
f 


We have so far made use only of the elementary connection 
between the groups 7 andc. If we now introduce the assumption 
that the number field over which our algebras are defined 1s 
algebraically closed, and is in particular the continuum of all 
complex numbers, the primitive characters of the finite group 7 
have the orthogonality properties 


Am x(Hx(F 4) =h, 


Ene) = 0 x +x). 
Furthermore, the number of primitive characters 1s equal to 
the number K of classes. The above relations assert that the 
matrix of the y(f), where y runs through the entire set of primitive 
characters and fF all classes, has as its reciprocal the matrix 


7+ n(f)x(E), 


Hence we also have 


This is, in fact, merely an alternative form of the completeness 
theorem. In dealing with the symmetric permutation group zy, 
{1 = fand the order ish = f!. 

On multiplying the expression (9.11) for the primitive 
character X by x(2yZ2 °° +) and summing over all the primitive 
characters x of a,, we obtain, with the aid of the relations 
derived above, the important formula 


ovay s+ = DS y(iyig + + +)X(&, &2, * + + En) (9.12) 
X 


where yx and X are the characters of corresponding irreducible 
representations of a, and ¢,. 


332 THE SYMMETRIC PERMUTATION GROUP 


§ 10. Direct Product. Sub-groups 


Programme. 

If two atoms or ions with fi, f, electrons, respectively, come 
together to form a molecule we may to a first approximation 
neglect the interaction between the two atoms so long as the 
distance between them ts relatively large. In this approximation 
the two kinds of electrons are dynamically different, for the 
electrons of each atom are influenced only by the nucleus and 
the remaining electrons of the same atom. The symmetry is 
therefore described by the sub-group 7’ of the symmetric group 
a == m, of f= f, + f, things in which the first f, and the last f, 
things are permuted among themselves. A similar situation 
arises when three or more atoms come together to form a 
molecule. These considerations immediately suggest the follow- 
ing problems. 

I. The theory developed in §§ 2-4 is to be extended to the 
case in which the symmetric permutation group is replaced 
by any permutation group 7’. Naturally the definition of a 
symmetric transformation in tensor space 1s to be adapted to 
the new situation: we require only that the coefficients 
Q(t; * + + tf; Ry + + + Ry) of (1.2) remain unchanged under an 
arbitrary permutation belonging to the group 7’ of the sub-indices 
1, 2,---,f. We say that these transformations are symmetric 
with respect to 7’; they constitute an algebra 2%” which 1s 
obviously more extensive than 2.—This question is immediately 
settled by the remark that all our previous deductions are valid 
for an arbitrary permutation group 7’. Here 7’ is considered as 
an independent group rather than as a sub-group of the sym- 
metric group. 

II. Let the set of integers from 1 to f be divided into two 
or more sub-sets. We consider, as an example, the case of 
two sub-sets: the ‘ red ’’ numerals from 1 to f, and the “ green ”’ 
ones from ltofg; ff t+f,=f. Let’ consist of all permutations 
of the red among themselves and the green among themselves. 
Hence a permutation s’ = (s,, Sg) of 7’ consists of a permutation 
s, of the f, red numerals and a permutation s, of the green ones ; 
wm’ is the direct product m, X wm, of the symmetric group 7, of f, 
and mw, of f, things. Or conversely, this direct product-—the 
abstract definition of which has nothing to do with the group 
of permutations of f things—may be considered as a sub-group 
ma of the symmetric group of f= f/f, + f/f. things on arranging 
the sets of numerals, on which permutations of 7, 7, act, onc 
after the other to form a single set. But here we are interested 
in the following problem (which can be proposed for arbitrary 
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finite groups): to discuss the properties of a group 7, X 7, 
which is the direct product of two finite groups 7, 79. 

III. In order to discuss the structure of molecules we must 
eventually take into account the interaction between the various 
atoms or ions contained in the molecule. This means that we 
must finally return from the sub-group 7’ to the full symmetric 
group m7, so we must examine the relations existing between the 
group 7 and its sub-group 7’. Here again the problem is not 
restricted to permutation groups. 


Direct Product. 


Let 7,, 7, be two finite groups of orders f,, fg respectively. 
The elements of the direct product w= 7m, X 7, are the pairs 
(s,, S,) consisting of an element s, of m7, and an element sy of 
av, An element of the algebra of mw is accordingly a function 
x(S,, Sg), and it follows from this that the algebra of am is the 
product of the algebras (7,) and (7,) : 


(77) == (77) X (79) 


in the sense of the X-multiplication of vector spaces introduced 
in II, §10. An element x,: %,(s,) of (7,) and an element x,: 
Xo(Sg) of (72) yield the element x% = x, X xX, of (7), whose com- 
ponents are given by 


X(Sy, Sa) == %4(Sy) + X(SQ). 


Indeed, given any two algebras p,, po, their direct product 
p == py X p,. can be constructed and multiplication in p defined by 


(@; X @q)(Dy X Oy) = (ad, X ab.) 


whether they are group algebras or not. 

If p, is a linear sub-space of t, = p, (a == 1, 2), an element 
X:X(Sy, So) of (mw) is in p =p, X p. if and only if it belongs to 
p, when considered as a function of s,, holding sg fixed, and to 
p. when s, 1s held fixed; indeed, any element of this kind can 
be expressed as a linear combination of products of the form 
a, X ag, where a, is in p, and a, in Py. If p.(a = 1, 2) is an 
invariant sub-space of ¥,, generated by the idempotent element 
e, and the representation space of the representation h, of p, 
induced in p, by the regular representation, then ) is also 
invariant, has as generating idempotent clement e = e, X é, 
and is the substratum of the representation }, X ). of p. It is 
evident that the equivalences p, ~ pj, Dg ~ Pz imply the equi- 
valence Py X Pe ~ Py X Po. 

Suppose the two p, considered above are also irreducible 
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with respect to their algebras p,; the question then arises as 
to whether p, X }, 1s irreducible (with respect to p) and whether 
pb =p, X pe, is equivalent to p’ = p; x pe (p, irreducible) only 
if Pi ~ $1, Pe~ Po. p and p’ are inequivalent if exe’ = 0 
identically in x, i.e. if the sub-space consisting of elements of 
character (e, e’) contains only the element 0; here e = e, X ég, 
e’ =e, X es. Now the formula 


(@, X €)(x, X %e)(ey X €g) = 05% 10, X CgX oly 


shows immediately that the sub-space (e, e’) is the direct product 
of the two sub-spaces (e,, e;) and (é, ¢,), and can consist merely 
of 0 only if one of these two sub-spaces consists merely of 0, 
i.e. only if p, is inequivalent to p, or p, is inequivalent to po. 
Our second question is thus answered in the afirmative—regard- 
less of the nature of the field over which the algebras are defined. 

The first question is answered in the affirmative in III, § 9, 
for the only case of physical interest, i.e. that in which the field 
is algebraically closed. If we are more interested in the re- 
duction of the algebra than in the representations we can argue 
as follows. The algebra of elements of character (e, e) is the 
direct product of the field (division algebra) ®, of elements of 
character (e,, €,) in p, and the field ®, of character (eo, ¢,) in pp. 
Assuming the original field is algebraically closed, all elements 
of ®, are multiples of e, and consequently all elements of p 
with character (e, e) are multiples of e. This proves the irre- 
ducibility of p, xX p,. If, however, the original field over which 
the algebras are defined is not algebraically closed our assertion 
is correct only if the direct product ®, x ®, of the two fields 
is again a field, and this is by no means always the case. But 
in any case the question concerning the nature of the direct 
product of algebras is, as in the question concerning the structure 
of an algebra in § 7, reduced to the analogous problem for fields 
(division algebras). 

Again taking the fundamental field to be the continuum of 
all complex numbers, the complete reduction 


y= Spy, ee Ze 
t 


into irreducible invariant sub-spaces p, has as a consequence, 
in accordance with the above, the reduction of t = rt, X f, into 
invariant irreducible sub-spaces p® x p). 

Sub-groups. 


Let zr’ be a sub-group of the given finite group 7. An element 
x’ of the algebra t’ = p’ = (m’) of a’ consists of components %’(s’) 
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associated with the various elements s’ of mw’. However, such 
an element can, and in the following will, at the same time be 
considered as an element of the algebra p = (7); we need only 
to define the components x’(s) associated with elements s of 7 
which are not contained in 7’ as zero. This disturbs in no way 
the addition and multiplication of elements of (7’) with each 
other or with arbitrary numbers of the field. An element x of 
(7) ‘‘ belongs’ to mw’ or ‘lies’ in (m’) if and only if all com- 
ponents x(s) associated with elements s of the group that are 
not in 7m’ vanish. 

An irreducible invariant sub-space p’ of vt’ is generated by a 
primitive idempotent clement e’ and is the substratum of a 
representation f’ of w’ induced in p’ by the regular representation. 
On reducing the modulus 1 of 7’ into independent primitive 
idempotent elements 


, 
l= Set: (10.1) 
t=] 


a certain number, say ¢’, of elements e, will appear which are 
equivalent to e’; the sub-spaces p; which they generate are all 
equivalent to p’ and the regular representation of m’ contains }’ 
g times. Equivalent summands are added together into 
such partial sums. Considered as an element of the total 
algebra p = (7) e’ 1s, however, in general reducible into inde- 
pendent primitive idempotent elements : 


b 
e= SYe,t-:: (10.2) 
a=l 
Here again equivalent summands on the right are collected 
together into partial sums; let the e, in the first such partial 
sum generate the representation ) of w—we shall in the following 
be interested only in these. Let the sub-space p with the 
generating unit e be a representative of the sub-spaces p, gener- 
ated by the e,. The elements of (a) of the form xe’ constitute 
an invariant sub-space <p’> which 1s the substratum of a re- 
presentation \h’> of a induced in p’ by the regular representation 
of m. Our formula asserts that on reducing «<’> into its irre- 
ducible constituents } occurs exactly 5 times. 

In order to obtain a simple characterization of the elements 
of <p’) we divide the elements of the group z into sets of group 
elements which are equivalent mod. 7’; the uw such class 
consists of the group elements o,s’, where s’ runs through the 
sub-group 7’. Anelement x of the algebra (7) has as components 
x(o,5°); the numbers x(o,5’) may, for fixed uu, be considered as 
the components of an element x), of the algebra (m’), so that x 
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may be considered as the set of elements x, belonging to the 
algebra (7’). The formula y = xe’ then becomes y,, = x,e’ in 
(7’\: hence x belongs to <p’> if and only if all the partial 
elements x, lie in p’. The correspondence 


X—> y= ax 
may then be written 


or 
, 4 , 
Mu DAyyXy 
v 


where @,,, is the element of the algebra (7’) defined by 


Ayy(S') = a(o,s’o,"). 

The representation <h’> may therefore be constructed as follows : 
first associate with the element a of (7) the matrix |la,,||, the 
coefficients of which are elements of the algebra (7’) instead of 
numbers, and then replace each a,, by the matrix A_,, associated 
with it in the representation h’ of a’. 

As we have seen 1n the earlier part of the present chapter, 
the representations are obtained with the aid of a double Peirce 
decomposition ; we therefore consider the elements x = e’xe’ of 
character (e’, e’). The idempotent elements e,, - + + appearing 
in (10.2) are of this character, and such an element x may be 
expressed in terms of its components 


b 
x= DD eyX@g +e. (10.3) 
1 


We now repeat the analysis of §7 for our more restricted set 
of elements: let I, be a one-to-one similarity correspondence 
of p, on p and let the element into which e, is sent by the corre- 
spondence I°,I'g* be denoted by e,,*. If, as we now assume, 
the field over which the algebras are defined is algebraically 
closed e,xeé, 18 necessarily a multiple x, of e.g. We then obtain 
instead of (10.3) the reduction 


X= DXaplap f° ', (10.4) 
(where the x,, are numbers) and the representations 
%—> |\Xapll, °° (10.4") 


* Here, as in § 7, but in contrast with our usual notation, the product of 
two or more correspondences I is to be read from left to right. 
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Now if in particular x is in (m’) then x = e’xe’ is a numerical 
multiple of (10.2), and the matrix ||x,|| associated with such an 
element is a multiple of the unit matrix.—The degree of the 
secular equation, the solutions of which determine the character- 
istic numbers, is thus decreased from g to b for an element x 
of character (e’, e’). We now proceed to examine the cause of 
this. 

Let I’;~! be a one-to-one similar correspondence of p’ on p, 
(c= 1, 2,-+-, g’), and let the element into which it sends e’ 
be b;. On considering an arbitrary element x of the algebra 
of 7 as the set x,, we see that the correspondence 


xe’ —> xb; 


is a one-to-one reciprocal and similar mapping of <p’> on <p,’>: 
the projection I’; of p,; on p gives rise to such a projection of 
<p; > on <p>. This projection associates with the reduction 
of <p’> into irreducible invariant sub-spaces a reduction of the 
same kind of the sub-space <p;>; corresponding to equation 


(10.2) we obtain the equations 


b 
ep= Lea tot. (10°5) 

a=l 
On combining (10.1) and (10.5) we obtain a reduction of the 
modulus 1 intoindependent primitive idempotent elements of 
(7). Now consider the partial sums Y'e, of 1 and their reductions 


t 
(10.5) as written one above the other. Each row 1s then as- 
sociated with a definite representation ’ of 7’ and each column 
on the right-hand side, the terms of which are sums of the form 
>» dai, 18 associated with a definite representation § of mw. We 


t % 


now collect together all the summands ey occurring in the first 
column on the right, 1.e. all those elements ey which are equivalent 
toe. The set of indices J is then broken up into sub-sets, each 
of which is associated with one of the inequivalent irreducible 
representations f’,--- of mw’; the first of these sub-sets, which is 
associated with h’, consists of the bg’ double indices a2. 

Let the similarity projection II,~! of p; on py, send e; 
into e;.,. If x’ is an element of (7’) the equation 


x! = Yeix'e, + 
i,k 
yields the reduction 
x’ = Xe Ci + Be oe (10.6) 


338 THE SYMMETRIC PERMUTATION GROUP 


with numerical coefficients x, and x’ —> ||x,,|| is the representa- 
tion ’. (The partial sums should preferably be written one 
above the other rather than horizontally.) I; may be con- 
sidered as a similarity transformation of <p,> on <p’> and 
therefore contains a transformation of the same type of p,, on 
p,; I;I, then provides us with a similarity correspondence 
of p,, on p. Let I’ be a fixed one-to-one similarity correspond- 
ence of py on p and let the similarity correspondence I',I'x' of 
by on Px send e; into ey, x. We may take the correspondence 
[Tas I’; for the index J = ai, and similarly for the remaining 
sub-sets. On applying the correspondence [i I-' =I (IyI,)7 
to equation (10.5) we find 


b 
Ci-k = 2 Oni; ok + my Es Re (10.7) 
The equation 
x= Yesxexg bot += Denes: etc (10.8) 
JK J, K 
then determines the representations 


bho: x |lxzell; + °° 


By (10.6) and (10.7) the matrix associated with an element x 
of (m’) is 


Xai; pk = Onp Xin, XK = O 


where the two indices J and A belong to different sub-sets. 
But this means that on restricting 7 to 7’ the representation h 
is reducible into the irreducible representations ’, - -- of 7’, 
bh’ appearing exactly b times. We have thus obtained a con- 
structive proof of the theorem °: 

First Reciprocity Theorem (for arbitrary groups). If <h’> 
contains the representation fh of m exactly b times, then on restrict- 
ing the group m to 7’, ) contains the representation fy of m’ exactly 
b times. 

If the sub-group 7’ consists merely of the unit element 1 
this theorem reduces to our previous result: the number of 
times an irreducible representation appears in the regular 
representation is equal to its dimensionality. Both the com- 
plete theorem and this special case depend on the assumption 


that the field over which the algebra is defined is algebraically 
closed. 


Connection with Symmetry Classes of Tensors. 


We apply the results of our investigation III to the symmetric 
group 7 and make use of the correlation described in I above for 
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7 as well as for its sub-group 7’. An irreducible sub-space p 
of (7) determines a symmetry class J = ap of tensors; let the 
corresponding representations of w and the linear group c be 
h and §), respectively. An irreducible invariant sub-space p’ of 
(7’) determines a symmetry class §$’ of tensors which is invariant 
with respect to the more extensive algebra 2” of all transforma- 
tions which are symmetric with respect to 7’; as such §’ is 
irreducible. If e’ is the generating unit of p’, 8’ consists of all 
tensors of the form @’F; but this is equivalent to saying that 
the symmetry element F of (7) belongs to <p’>. Hence the 
reduction of $8’ into irreducible invariant sub-spaces with respect 
to the more restricted algebra 2 parallels the reduction of <p’). 
Let h’ be that representation of a’ induced in p’ by the regular 
representation of 7’ and §’ that representation of ¢ whose sub- 
stratum consists of all tensors in the symmetry class $8’. Hence 
our gencral theorem—or rather its converse, the truth of which 
follows immediately from the theorem itself—allows us to state 
the 

Second Reciprocity Theorem (applicable only to permutation 
groups) If the irreducible representation of mw contains the 
irreducible representation fy of m’ exactly b times when considered 
as a representation of the sub-group 7m’, then conversely the repre- 
sentation §' of ¢ contains the representation S) exactly b times. 

Finally we take 7’ as 7, X am, as in step II above. p’ can 
then always be taken in the form p, X p,, and the irreducible 
invariant sub-space p, of (7,) determines a symmetry class 9, 
of tensors of order f, (# = 1, 2). Denote the corresponding 
representations of 7, and c by §, and ),. The SR’ associated 
with p’ = p, X p, consists of all tensors of order f=/f, + f, 
which satisfy the symmetry conditions of ‘8, with respect to 
their first f, indices and the symmetry conditions of 98, with 
respect to the last fg; 1.e. 3’ = B, xk B,. Our theorem now 
becomes : 

Third Reciprocity Theorem (for permutation groups). IPf the 
trreducible representation of m contains, on restricting m to the 
Sub-group am’ = 7, X 2, the representation hy X hy of mw’ exactly 
b times (h, an irreducible representation of m,), then conversely the 


representation {), X ), of ¢ contains the representation § exactly b 
times. 


§ 11. Perturbation Theory for the Construction of 
Molecules 


We return to the investigation of the physical system Jf 
consisting of f electrons or equivalent individuals J. As long 
as we disregard the interaction between the individuals we obtain, 
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among others, f!-fold energy levels E of the type (8.4). We 
consider in particular the case in which the F, are different 
simple levels of the individual J. In order to follow the resolu- 
tion of E, due to the mutual interactions of the electrons, to 
the approximation which characterizes the perturbation theory, 
we must first determine the elements a of the algebra of a, the 
components a(s) of which are the exchange energies, and trans- 
form the matrices corresponding to a in the various irreducible 
representations of mw into diagonal form by an appropriate 
change of co-ordinates (§ 8). We now assume that the most 
important of the exchange energies a(s) are those belonging to 
the permutations s of a certain sub-group 7’ of a; all others 
shall be small in comparison with them (‘‘ quantities of 274 
order’’). Our procedure 1s divided into two steps, corresponding 
to the investigation of sub-groups carried out in the preceding 
section. Let a’ denote that element of the algebra (m’) which is 
defined by 


a'(s) = a(s) or 0 


according as s is an element of the sub-group 7’ or not, and let 
the matrices associated with a’ in the irreducible representations 
bh’ of a’ be referred to principal axes; then 


eae, = 0 (i+k), ea'e, = W;-e, 


t° 


The characteristic numbers W, are the energy levels on neglecting 
perturbations of 2"¢ order; we assume they are all different. 
In order to examine the further resolution of such a term 
W = W, under the influence of the 2"4 order perturbation we 
need, in accordance with the perturbation theory, to consider 
only that part 

a* = eae’ 


of a which is of character (e’, e’), where we have written e’ in 
place of e,. This term yields b terms W, belonging to the 
symmetry class x associated with the irreducible representation 
of a, the values of which are the characteristic numbers of 
the matrix |la,,\| associated with the element a* = e’ae’ as in 
(10.4°). All the algebraic elements appearing in these con- 
siderations are real and the corresponding matrices are con- 
sequently Hermitian. 

We apply the procedure to the process by which molecules 
are constructed from their constituent atoms.4° We consider 
as an example two atoms joining to form a molecule, the one 
containing f, and the other f, electrons; f= f, +f, We 
consider the two nuclei as held fixed at a distance d@ apart, which 
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is large compared with the linear dimensions of the atoms, and 
attempt to determine their interaction energy as a function of d. 
The sub-group 7’ = 7, X 7, consists of all permutations which 
send no electron of one atom over into the other; we have seen 
in § 10 that we may then take the primitive idempotent elements 
e, =e’ of the algebra (m’) in the form e, X é, where @, é, are 
in (77), (72) respectively. On neglecting the interaction between 
the electrons of the one and the electrons of the other atom we 
obtain an energy term W which belongs to definite symmetry 
states of both atoms. e’ generates a sub-space ’ = 8, x P, 
(of the tensor space §t/) which is invariant under all symmetric 
transformations; that the state of the molecule is described 
by a tensor of this sub-space §§ means that the state of the first 
atom is in $8, and that of the second in §8,. Hence on reducing 
33’ in parallel with the reduction of <¢p’> into irreducible 1n- 
variant sub-spaces : 


Zee De hyp eS NOs A RO es, 


there occur b sub-spaces $8) which are equivalent to one another 
and which belong to a certain representation of 7 or to a certain 
symmetry class of terms of the total system. The procedure 
sketched in the preceding paragraph thus leads to b terms which 
(1) arise, due to the perturbation, from the given unperturbed 
term (8.4) and (2) which belong to certain given symmetry 
states y,, x2 and y of the two atoms and the molecule. This 
reduction of the total system space #t/ into sub-spaces, each of 
which corresponds to a definite symmetry state of each of the 
atoms taken separately and of the molecule, naturally is not 
bound up with the approximate calculation of levels with the 
aid of perturbation theory; the connection between the two 
appears only on taking the above condition (1) into account— 
the very essence of which implies the assumption of small per- 
turbations. This somewhat sketchy account of the situation 
arising from an unperturbed term of the type (8.4), in which 
the energies F; of the individual J are non-degenerate, can readily 
be extended to cover other more complicated types of unper- 
turbed terms. These other cases are of course of much greater 
physical interest, for we have seen in Chapter IV that all atomic 
energy levels, except S-terms, are necessarily degenerate.!! 

The fact that the total system may be in any one of several 
symmetry states $8, corresponding to different energy levels 
(i.e. binding energies), when the symmetry states of the com- 
ponent atoms are given is of greatest importance. We shall 
later show that these possibilities, finite in number, coincide with 
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those predicted by the empirical theory of the valence bond, and 
that consequently the symmetry state of an atom is that which 
chemists call its valence state. The situation thus arising cannot 
be described adequately in terms of classical models—e.g. the 
fact that the two H atoms constituting an H, molecule can be- 
have in such a way that the state of the molecule may lie in 
either the space of symmetric or anti-symmetric tensors of 
order 2; only the first case can lead to an attraction which will 
bind the atoms together—the second always results in a re- 
pulsion.!# The binding energy between two ions of total residual 
charges @,, é, is naturally due mainly to the Coulomb potential 
€,e,/d (‘‘ ionic binding ”’ or ‘“‘ polar bond ’’), but the corresponding 
energy for two neutral atoms is due for the most part to the 
interaction of the ‘‘ exchange energies ’’ a(s) of the electrons of 
the two atoms (‘atomic binding’”’ or ‘‘ non-polar bond’’). 
This quantum-mechanical solution of the puzzle offered by the 
non-polar valence bond was first given by Ff. London and 
W. Hettler. 

The following points are to be taken into consideration in 
applying the theory of perturbations to the actual evaluations. 
On neglecting the interaction between the various electrons 
each is subject only to the attraction of the two nuclei; we 
should therefore perhaps begin with the characteristic numbers 
E, and the corresponding characteristic functions w,(xyz) of 
this one-electron problem. The first approximation should then 
be obtained by taking into account the repulsions between the 
electrons of each of the atoms separately, thus introducing a 
dynamical difference between the two kinds of electrons. This 
procedure is naturally significant only so long as the distance d 
between the atoms is large in comparison with their linear 
dimensions a. But then it 1s also reasonable to take as our 
0 approximation that in which each of the electrons is subject 
only to the attraction of its own nucleus (plus the closed shell 
of electrons which are not to be taken into explicit account in 
the calculations). Let this one-electron problem for the first 
atom have the characteristic values E; and characteristic func- 
tions #;, and let the corresponding quantities for the second 
atom be £;,, Jy. The fact that the J, and the g, together 
cannot constitute an orthogonal system—zindeed, they are not 
even linearly independent, for the %, alone constitute a complete 
orthogonal system—causes some difficulty. But if we break off 
the series of quantum states at a finite n—which can be chosen 
higher the larger the value of d/a under consideration—the 


finite set 
yp: b,, Po, PRS Vn Pr, bo, ee es Uy’ 
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of functions % constitute an almost orthogonal system; the 
fundamental metric form Gp», the coefficients of which are the 
scalar products 


Lin = (Bi, by) = jh b, dV 


(where 2 and & run through the primed as well as the un-primed 
indices), differs but little from the unit form. Indeed, an integral 
of the form (,, %,) is of order of magnitude e~#*, To show 
this we note that if the two centres of force are nuclei or closed 
cores with “unit” residual charge, the normal states of the 
atoms are given by 


where ” and r’ are the distances to the two cores. The integrand 
In 


l : 4 
Pr Pr) = —le" peueay 


is everywhere Se~%", This integral can readily be exactly 


evaluated on introducing bi-polar co-ordinates (r, r’, 6); the 
volume element is then 


») 
dy)’ == rr’ dr dr’ 
a 
and the range of integration is defined by 
y+tr2d—dsr—r <d. 


On introducing 


c+ 
A? 9 t , 
(py, yb) ace | [6° fe -\e~ dp dp 
—1 


aN 
te 


p* — ;)emdp = on +. Xd + ) 


For the f-electron problem we therefore start with the 
functions 


Plt, 7 +, ty) = LD yp,(xys) 
t 
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as approximations to the characteristic functions; in this 
product the co-ordinates are those of the f electrons and 7 runs 
through the values 7), 72, : +, zy, each of which is one of the primed 
or un-primed indices between 1’ and n’ or l and n. The funda- 
mental metric form G = Gy X Gp X +++: X Gy has as components 
the scalar products of (1), 72 ° + + 1,) with (k,, ky + + + ky) and 
the components of the energy H, the potential part of which ts 
obtained by adding together the potential energies resulting 
from the attractions and repulsions of the various electrons and 
the two cores, are the scalar products of #(1, - - - ty) with the 
vector Hgb(k, - + + ks) into which #(k, + + - Ry) is sent by the 
operator H. We consider the resolution of the unperturbed 
term 


| Oo ee cae er 0 a aaa aa O12) 


The components 


Gays P8475 Boe * Rh) and Faye 4p Rye tek), 11) 


in which the indices 2, Rk are permutations s, t, respectively, of 
1e++, ff, +--+, fo’, are of the form G(st7!) and H(st“!). We 
introduce the (real) elements G@ and HM with components G(s) 
and H(s). G and Hf are next replaced by G’ and H’ with com- 
ponents G(s) and H(s) if s is in m’ = am, X a, and 0 otherwise ; 
the justification for this lies in the fact’ that the components 
associated with an s which is not in 7’ are very small—they are 
of relative order e~*4/", @’ is in fact the modulus, whereas G 
is not; the procedure employed previously must therefore be 
modified in the following purely formal respect. On repeating 
the reasoning, keeping in mind the fact that G@ is no longer the 
modulus, we find as the secular equation for the determination 


of the b terms A = W, 
| AG xg — Hyp | ex); (11.2) 


in which 


e'Ge’ = SGxg Cap + °° *, 
ap 

e’He’ = SHagesp te 
ap 


in terms of the notation employed in the preceding section. 

This procedure is open to the criticism that whereas the 
second order perturbations between the electrons of the same 
atom are neglected, the interaction between the two atoms, which 
is considered to be of second order, is taken into account. The 
results are therefore inapplicable to the limit d/a— oo and can 
at most be applied successfully in cases in which d/a is consider- 
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ably larger than 1 but not too large. On the other hand, we 
could begin by assuming that the solution of the quantum 
problem for the individual atoms is already known. Let the 
function wW, of the co-ordinates of the first f, electrons be a 
characteristic function of the first atom corresponding to the 
energy term E, (so normalized that the integral of x5, is unity) ; 
it will belong to a certain simple symmetry state of the first 
atom, 1.e. there exists a certain real primitive idempotent element 
e, of (7,) such that é6,, = ~,. Similarly, let %, be a character- 
istic function of the second atom for the term ., having a 
corresponding property @,% = .. Neglecting the interaction 
between the atoms, p = W,.%, 1s a characteristic function of 
the molecule consisting of the two atoms and having the energy 
E=EH,+ hb, e' =e, X e, is a primitive idempotent element 
of the algebra of m’ = 7, X a, and # has the property 
e's = 

The functions s%, which are obtained from ™% by the totality of 
f! permutations s of its arguments, span a linear function space 
(R) of a finite number of dimensions—in which the s# are natur- 
ally neither linearly independent nor mutually orthogonal. 
The theory of perturbations requires us to find those functions 
¢@ of (RM) which are such that the orthogonal projection of Hd 
on () is proportional to ¢ itself; the factors of proportionality 
are then the values of the displaced terms, to a first approxima- 
tion. We must therefore evaluate the integrals G(s, t), f(s, t) of 


th-sh and tp: I1(sp) 
and solve the secular equation 
JAG(s, t) — H(s, t)| == 0. 
G and H depend only on t"'s :* 
Gis; CEs). TIS.) SACS). 


This is proved by the fact that the integral of f- 4 is unchanged 
on replacing yf, ¢ by ri, rd (ry an arbitrary permutation) ; //(sd) 
is equal to s//~ because of the symmetry of the operator H. 
Let @ and Hf again be the elements of (7) with components 
G(s), H(s). They satisfy the equations 


e’Ge’=G, e'He’=H 


* On comparing this with (11.1) it is to be remembered that there the 
permutations s and ¢ operate on the indices and not on the arguments ; hence 
the elements (11.1) are, in our present notation, 


G(t-1, s-), and) H(t“, s-). 
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and are therefore of character (e’, e’). Indeed, we have, for 
example, 


y= Ye'(r)- rd, whence H(sp) = Ye'(r) + H(srib), 
and on multiplying this latter by % and integrating we find 


H(s) = Ye'(r)H(sr) or H = He’. 


It then follows that also H = @’H whence, since e’ is real, 
HT = e’H and consequently H = e’He’ as asserted. 

The only non-vanishing elements of the matrix ||H;R|l, 
which corresponds to the element HA in the representation ), 
are (in the notation of § 10 with e, = e’) those contained in the 
square sub-matrix of length 6 in which the row and column 
indices J and K are of the form «1. We are thus led directly 
to the secular equation 


| AG ap ice Hap | a 0 


of b'" degree. (The most natural method of solving this equation 
consists in finding that linear transformation which sends the 
Hermitian form with coefficients G,g into the unit form and at 
the same time reduces ||H,,|| to diagonal form.) 2H,, is then 


the trace of the matrix belonging to H in the representation ), 
or 


SH,, == ZH(s)x(s). 


If in particular b = 1 the above symmetry system of the 
molecule contains but a single term arising from the unperturbed 
term /; its value is, in accordance with the equation derived 
above, given by 


LH (s)x(s) _ E + 4° H(s)x(s) 


ZGls)x(s) FG )x(s) iii 
The accent on the right-hand side indicates that these sums are 
to be extended over only those permutations s which do not 
belong to v7’. This formula (11.3) is due to F. London.® It 
will be shown later that in the case of diatomic molecules ) 
is always 1; we must expect, however, to find higher values of 
b in dealing with more complex molecules. The real difficulty 
from the physical standpoint naturally consists in getting in- 
formation concerning the exchange energics H(s). It is to be 
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noted, however, that we need only to concern ourselves with the 
sums 


r 


DH (rsr),  SG(rsr7}) 


over the various classes, for since x(s) is a class function all 
summands in (11.3) for elements in the same class — may be 
added together to give the above coefficients multiplied by x/(f). 

Without doubt these investigations, which are as yet in their 
infancy, are of fundamental importance for theoretical chemistry ; 
the non-polar bond is due to the exchange energies. Heisenberg 
has given an explanation of ferro-magnetism with the aid of 
these same principles." 


§ 12. The Symmetry Problem of Quantum Theory 


On taking the spin into account the components of a vector 
x(ut), which represents the state of a single electron, has two 
indices « andz; the first of these refers to the spin and runs from 
1 to v, while the second refers to the translation and runs from 
lton. Actually v = 2 and 2 = oo (as long as we do not restrict 
ourselves to the consideration of quantum states with fixed 
energy). Our vector space ®t is accordingly ,, = Ry, x Rp. 
The state of a system consisting of f electrons is now to be 
represented by a tensor of order f in this space: F(tyty, tgte, 

+, uyty)—a ‘‘ double tensor”? which stands, so to speak, with 
one foot (the Greek indices) in the space R, and the other (the 
Latin indices) in &,. This tensor space is completely reducible, 
with respect to the algebra 2,,, of all symmetric transformations 
of the index pairs (tz), into irreducible invariant sub-spaces, 
each of which ts generated by -.n idempotent symmetry operator. 
The Pauli exclusion principle states that only one of these sub- 
spaces ‘I, 1s physically realized ; it automatically abolishes the 
physically absurd existence of multiplicities which cannot be 
resolved and at the same time denies the existence of absolutely 
non-combining systems of terms. Furthermore, according to 
Pauli this %,, is the space {R/,,} of all anti-symmetric double 
tensors. 

On ignoring the spin perturbation, 8, 1s to be reduced as far 
as possible into sub-spaces §§$ which are invariant with respect 
to the special symmetric transformations of the form 


F (sadn s+ + tig) = Belin s+ ips hy by) + Fluka + ey) (12.1) 


%% 


which do not depend on the Greek indices at all; these constitute 
our old algebra 2’ = 2. This transition from 2,, to 2, 1s to 
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be accomplished in two steps. We first ignore the interaction 
between spin and translation, but allow the translations to 
interact among themselves in an arbitrary manner and similarly 
the spins among themselves; we must then consider only the 
symmetric transformations of the form 


wept yy Ky ttt Ky) ety +s dys Rye Ry). (12.2) 


These transformations do not constitute an algebra themselves, 
but they belong to their ‘ enveloping’ algebra 2, x 2, which 
consists of all transformations whose coefficients 


C(t, aes ees KR) 


are unaltered on subjecting the two rows g++ + ty; Ky °° * Ky 
of Greek indices to the same arbitrary permutation o and the 
two rows of Latin indices to the same arbitrary permutation s. 
The second step then consists in letting y in (12.2) be the identity. 
The first step thus consists merely in making the permutation 
of the Greek indices independent of the permutation of the Latin 
indices, and the second in restricting the first of these permuta- 
tions to the identity. 

In the first place, then, we introduce the elementary sym- 
metry operator o X s which, on applying it to the double tensor 
Fut, * + + tyty), subjects the Greek indices to the permutation 
o and the Latin to the permutation s. The general symmetry 
operator is then an arbitrary linear combination 


a== Sala, s)\(o X S) 
0,8 

of these elementary ones ; we have thus to deal with the algebra 
p X p of elements x, the components x(o, s) of which are functions 
both of whose arguments run through the elements of the group z. 
We denote the element with components F(a, s) == (o X s)F 
by Ff; the equation fF’ = aF (F’ the double tensor obtained 
from F by the operator a) is equivalent to F’ = F.a. The 
group 7 X m of elements o X s contains 7 itself as the sub-group 
consisting of elements s X s. So far as the first step is con- 
cerned, our problem amounts to the following: Let l(s) be the 
components of a primitive idempotent element of the algebra 
t= p= (zm); we set 


I= Sl(s)(s x s) 


and study the elements of the form xl in p x p. They con- 
stitute an invariant sub-space (t x t),; which is to be reduced 


SYMMETRY PROBLEM OF QUANTUM THEORY 349 


into its irreducible invariant constituents; in Pauli’s case we 
have in particular 


] 
I= 77.2°8,(8 x Ss). 
The procedure which it seems natural to follow is first of 
all to express the modulus 1 of p in any two ways as the sum of 
primitive independent idempotent elements : 


l=Je, l= Ze,. (12.3) 
¢ j 
An arbitrary element x of the algebra of p x p is reduced into 
independent constituents in accordance with the equation 
a= Yr(e; x e;)) = Yx,;. (12.4) 
J i,j 
Now we know from § 10, II, that the elements of the form x,, 
constitute an irreducible invariant sub-space p,,;; consider 


xl ae Siig 
1,9 


in this light. The projection x —~ y = xl sends p,; over into 
a certain invariant sub-space (p,,;) of (t x t), Since those 
x of p,; for which xl = 0 constitute an invariant sub-space of 
p,; we have only the two typical possibilities: either (p,;) = 0 
or this projection x -—> xl maps ),; 1n a one-to-one and similar 
manner on (p,,).. The sum 


(Ux Tt), = LPas), (12.5) 


arranged in some particular order, 1s such that each term can, 
in virtue of its irreducibility, only either be contained in the 
sum of the preceding terms or be independent of this sum. On 
retaining only those terms arising from this second possibility, 
(tc < rt), is completely reduced into the sum of certain of the 
(p,;); the representation induced in (t X rt), by the regular 
representation of the group am X mw 1s correspondingly reduced 
into its irreducible constituents of the form h’ x }. It will be 
remembered that this symbol stands for the correspondence 


(o, s) > U'(c) x U(s), (12.6) 


where h’, are the irreducible representations o — U'(a), 
s—> U(s) of aw. This representation h’ X h appears with a 
certain multiplicity d(x’, x) which is determined by the number 
of pairs 77 in (12.5) whose e,’ generate the representation }’ 
and whose e, generate }. These considerations are of course 
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merely a repetition for the case at hand of the proof of theorem 
(6.1). 

We now return to the space of double tensors and consider 
the sub-space %{ defined by those of the form IF. It is the 
substratum of a certain representation (2, x 2',) of 2, xk 24, 
and its complete reduction is given by the formula 


QS, x La) = Ty’, xO) X Oy). (12.7) 


This remains correct even if v or ” is less than f. Earlier in 
this chapter we introduced the right- and left-invariant sub- 
space TY, of r as that sub-space consisting of all elements F which 
correspond to tensors F in the n-dimensional vector space ®,. 
On denoting this tg, which depends on » (and only for x 2 f 


n 
coincides with the entire rt), by t we should consider the algebra 


v n y n 

t X Linstead of r x t. But if e; is in r and e, in t, the manifold 
v n 

of elements x(e, X e,;) is not decreased on restricting x to r X f, 


t 
and every é, (e:) which is equivalent to such an e; (e,) also 
) 


k 
vem 
belongs to x (vr). This shows that (12.7) remains correct under 


v n 

this restriction to t X t; the only effect is that those terms for 
which §, % §, is the 0-dimensional representation are illusory 
We are now ready to take the second step: to perform the 
transition from the algebra 2, x 2, to X = 2, by taking y in 
(12.2) as the identity. We then see immediately that the 
representation &(2') of 2, whose substratum consists of the 
double tensors of & in the sense of equation (12.1), is completely 
reduced into its irreducible constituents §>, corresponding to 
the various primitive characters x of w, in accordance with the 
equation 


UY) = dmx) 5). 


The multiplicity m(y) with which this representation § occurs 
is given by 


mlx) = 2'0(x’, x) N(x’), (12.8) 


where N,(xy) is the dimensionality of the representation Qn, 
and the sum is extended over all the primitive characters ,’ 
of w. Hence on disregarding the spin perturbation we obtain 
the same type of reduction into non-combining systems of 
terms as before, except that the multiplicitv, which was previ- 
ously equal to the dimensionality g of x, is now given by (12.8). 
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(The spin perturbation causes weak inter-system combinations 
to take place and, in addition, resolves each term of the system 
x into its m(y) components. m/(x) is the multiplicity of the 
multiplet structure. Term systems x for which m(y) = 0 do 
not appear at all.) 

Our reciprocity theorem enables us to determine the con- 
stants b. As mentioned before, 7 is contained in 7 X @ as the 
sub-group of elements of the form s xX s; the algebra p = (7) 
appears in p X pas the totality of algebraic elements of the form 
> a(s)(s X Ss). The elements x/ of the algebra p constitute an 
8 


irreducible invariant sub-space p,; let the irreducible repre- 
sentation of a7 which is induced in this sub-space by the regular 
representation be denoted by ), and its character by A(s). The 
space of all elements of the form x/ in p X pis then <p) in the 
notation of §10; it is the substratum of the representation 
<h,> of p X p. <h> contains the representation h’ x fh exactly 
b times; the reciprocity theorem then tells us that the number 
of times the representation h’ x f contains the representation 
h, on restricting 7 X a to its sub-group 7z is also b. Now this 
restriction to m sends the representation (12.6) of a X @ into 
the representation 


(s, s) > U"(s) xX U(s) 


of 7. This means, however, that D(x’, x) 1s the number of times 
the representation lt), of m 1s contained in the representation h’ x h 
of mw (no longer with boldface multiplication sign !). Hence 
b is expressed by 


D(x’, x) = Mix'(s)x(s)A(s~)}- (12.9) 
With this we have carried our solution of the problem of deter- 
mining the multiplicities m(x) as far as is possible in the general 
case. 

Consider in particular the special cases (1) complete symmetry, 
= [R/], and (2) complete anti-symmetry, & = {Jt/}—the 
Pauli case. For the first A(s) = 1. With each irreducible 
representation y 1s associated the contragredient representation 
with character X(s) = x{s“1); if the substratum of the first 
is generated by the idempotent element e the substratum of 
the latter is generated by é@. Or we may describe this situation 
by saying that y and x are the characters of mutually contra- 
gredient representations. (Accidentally x(s~!) = yx(s) for the 
complete symmetric group 7; this does not hold for a general 


permutation group, however, whereas our entire theory does.) 
Equation (12.9) now becomes 


b(x’, x) = Mix’(s)x(s~)}. 
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But in virtue of the orthogonality property ot characters this 
mean value is 1 or 0 according as the representation ¥ is equiv- 
alent to y’ or not. The expression (12.8) for the multiplicity 
then assumes the simple form 


m(x) = N,(X). 


The theorem that the representation ) x contains the identical 
representation s > 1 once or not at all according as ’ is equiv- 
alent to the contragredient of or not is nothing other than 
the fundamental theorem [III, (10.5)] on which the entire 
theorv of representations was based. 

In the second (anti-symmetric) case A(s) = 6,. Now 


x*(s) = 83+ x(s™) 


is the character of the ‘“ dual’’ representation * associated 
with h; if h is generated by the idempotent element e then h* 
is generated by the idempotent e*(s) = 6,:e(s74). Or if 


§:s—> U(s) then §*:s—6,- U(s). The expression for the 
multiplicity is in this case 


m(x) = N,(x*) (12.10) 


If we denote the 1-dimensional representation s— 6, by {1}, 
the fundamental theorem mentioned above tells us immediately 
that h’ x contains the representation {1} once or not at all 
according as f’ is equivalent to h* or not. (12.10) is the actual 
multiplet formula, for this second case 1s the one which 1s of 
interest for atomic physics. 


Additional Remarks. 


The only cases of importance for physics, (1) that of sym- 
metric and (2) that of anti-symmetric double tensors, can be 
handled by elementary methods. We again refrain as long as 
possible from making restrictive assumptions concerning the 
field over which the algebras are defined. The method will be 
illustrated by application to case (1). 

(12.11) If e,, e, are equivalent idempotent elements, then 
é1, és are also. 

Proof. Let p, be mapped on p, by a one-to-one similarity 
correspondence I": x, = %,b; 6b is here the element, of char- 
acter (é€,, é,), into which e, is sent by J’. Let the inverse corres- 
pondence carry é, over into a, which is then of character (és, e,). 
I carries a over into eg; since the element associated with a by 
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I’ is ab we have eg = ab. Similarly, we find with the aid of I 
that e, = ba. We then have 


€, = ab, e:= ba; ee, =a, e,be, = bd. 
Converscly, the existence of these equations guarantees that 
Xo — x45, Xy — XA 


are reciprocal similarity correspondences ), <7 P>.. That is, the 
existence of these four equations means that e, and é, are 
equivalent. We need only to “ roof”’ these equations in order 
to conclude that é, and é are then also equivalent—i.e., go 
over to the quantities % associated with each of these x by the 
definition x(s) = x(s~1). We have here neither assumed that 
the e are primitive nor that the field is algebraically closed. 


(12.12). The invariant sub-spaces ), p generated by e, é are 
the substrata of mutually contragredient representations. 

Proof. let p consist of all elements xe; we introduce in 
addition to this left-invariant sub-space the right-invariant 
sub-space q consisting of all elements of the form ex. Let 
tr (xy) be the trace of the elements x and y, which may vary 
freely in ), q, respectively ; we assert that it is a non-degenerate 
bilinear form. That is: if tr (ay) = 0 identically in q then the 
element a of p must be 0, and if tr(xb) = 0 identically in p the 
element b of q must be 0. Indeed, if zg is any arbitrary element 
whatever and a 1s in $, then 


az = de*2 = a: e2 = ay, 


where y = ez ising. Hence the assumption that tr(ay) = 0 in q 
implies that tr(az) = 0 for arbitrary z, whence a = 0 [cf. § 4]. 
Similarly for the remaining case tr(xb) = 0. 

Now let p and q be referred to arbitrary co-ordinate systems 
and let the co-ordinates of x, y be &,, €,° °°, &3 1, Na ° °°) Ma 
respectively. Then tr(xy) is of the form 


tr (xy) = osu Es Nk. 


The theorem above shows that g SA and h Sg, whence h = g, 
and that the coefficients s;, may be considered as the coefficients 
of a non-singular linear transformation. Hence on choosing 
the co-ordinate system in q in an appropriate manner tr (xy) 
may be reduced to the canonical form 


tr (xy) = Dx? Ne 
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But then 
tr (xy) = tr (yx) = tr (yr7)- rx). 
Hence the simultaneous substitution 
a= re, y= yr, 


which does not lead out of p, q respectively, leaves the trace 
invariant. These two transformations are therefore contra- 
gredient in the new co-ordinate systems; our assertion (12.12) 
then follows immediately on writing the second of these equations 


a» 


in the ‘‘ roofed’ form 9’ = ry and noting that ¥ runs through 


the left-invariant sub-space 5 generated by é as y runs through q. 
After this preliminary skirmish we apply the method em- 
ployed before, somewhat modified, to the case (1) in which 


l = AEG x S). 


We are now interested in the reduction (12.4) only for symmetric 
elements x, 1.e. elements which satisfy the equations 

x(or, sr) = x(a, S) (12.13) 
for ally. This amounts to replacing x by x1; we subsequently 
note that xl(e’ x e) is not symmetric and accordingly multiply 
again on the right by l. We thus replace e’ x e by lL(e’ X e)l 
rather than (e’ x e)/ and proceed to obtain an explicit expression 
for the reduction, rather than calling on the aid of the reciprocity 


theorem. First, the components of J(e’ x e) are (on ignoring 
the factor 1/f!) given by 


Ze'(ra)e(rs) = Zesty ")e'(ra) = ée'(s"}o), 


This expression vanishes if ée’ = 0; for e’ = é we find it 1s 
equal to é(s~10) = e(a4s). This suggests that we choose 


| Dae | 2: 


as the two complete reductions (12.3) of the modulus 1. The 
only terms in the sum (12.4) which then remain for symmetric 
x == xl are those of the form x(é,; x e;), and the factor l(é; x e,) 
is the element with components e,(o7's). Since x(é, X e,;) has 
not been reduced identically to 0 on restricting x to the domain 
of symmetric elements, the sub-space which it generates is 


here, as before, equivalent to the irreducible b, x p; The 
next step consists in multiplying on the right with /, whereby 
e(a- 1s) becomes, in accordance with (8.3) and (7.22), 


l ~-1,-1 = i ~ly) — 1 ~1 
pice’ a sr) = Fxls o) = Z *&(a7 15), 
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Our final result is that any symmetric x can be reduced in ac- 
cordance with 


x == xe’ + xe’ +++ + where e(c, s) = 7 e(o's) > (12.14) 


in deriving this result it is to be remembered that the number 
of times any irreducible representation appears in the regular 
one is given by its dimensionality. 

It follows from the fact that e(s) 1s a class function that these 
elements g’, §’’, + + + constitute a set of independent idempotent 
elements in p X p. This result is in fact obtainable by direct 
methods and is valid, regardless of whether the field in which 
we are operating is algebraically closed or not. To show this 
we note that any ‘‘symmetric’’ element x(o, s) is a function 
only of so™! in virtue of (12.13): x(o, s) = x(so7!). Thus there 
exists a one-to-one correspondence between the symmetric 
elements of p X p—the space of which we denote by [t x t]— 
and the elements of r. Direct computation shows that this 
correspondence associates with each left-invariant sub-space of 
[¢ x t] a left- and right-invariant sub-space of t, and conversely ; 
the reduction of [rt x r] into left-invariant sub-spaces thus 
parallels the reduction of t into sub-spaces which are both left- 
and right-invariant. The whole problem is thus much simpler 
for [r x tr] than for r itself ; its solution is obtained by carrying 
over the equation 


x=xe tre’ t-: - (7.5) 


for the algebra p to [t x rt], the result of which is (12.14). 
Nevertheless we must return to the previous less elementary 
analysis in order to see—and this result presupposes that the 
field is algebraically closed—that each of the irreducible in- 
variant sub-spaces of [t x r] obtained in this way is equivalent 


to a sub-space of the algebra rt x t of the form b x p (where 


p and p are irreducible invariant sub-spaces of t with generating 
units e and @). 

The completely anti-symmetric case can be dealt with in a 
corresponding elementary way. 

The complete reduction of the manifold Rf of tensors in the 
2-dimensional spin space WR,, v = 2, is accomplished with the 
aid of the Clebsch-Gordan formula [III, (5.9)]. (c)f is ©, x ©, x 

- x @, (f factors), where ©, is the representation of the linear 
group C = C, by itself, and by the formula mentioned above this 
representation is completely reducible into the irreducible @,, 
where v can assume only the values f, f— 2, f—4,-°-+. The 
dimensionality of ©, is v-+ 1, and to each of these possible 
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limensionalities there corresponds here but one irreducible 
epresentation. Formula (12.10) then tells us that there exists 
nly one term system having the multiplicity v + 1(= f+ 1, 
— 1, f—3,°°-); compare the beginning of § 15 on this point. 

The preceding analysis seems to me to be necessary in order 
o obtain a complete understanding of the relations implied by 
he permutation group without recourse to the approximation 
haracteristic of the theory of perturbations. So far as the 
atter is concerned we proceed as follows. Again consider a 
erm of the form (8.4) of the unperturbed system, the only 
egeneracy of which ts that necessitated by the equality of 
he f electrons. The perturbation equation is then 


F(ty2y, Be Ge Re tyty) ee > a(st~*) . Fuk, ae, Ay usRz), (12.15) 
t 


there the a(s) are the exchange energies and 1, °-- iy, ky +--+ ky 
re obtained from 1--- f by the permutations s, ¢ respectively. 
et d be the tensor in spin space defined by 


F(yl, bony. Ns, tf) ox P( tabs a by) 5 


he anti-symmetry of the double tensor F then tells us that 


F(uyty, my Lyly) me 3s ° Ss 1h(t, os i ty), 
nd on letting a’(s) = 8, + a(s), (12.15) becomes 
@ = a’d. (12.16) 


‘he problem is thus reduced to that of finding the characteristic 
umbers of this linear correspondence in the 2/-dimensional 
pace Rf. 
Let %,(P) be the characteristic functions of the single electron. 
f the perturbation is due solely to the Coulomb forces between 
he various electrons, that part af the energy matrix a(i, -- ° 1,; 
1 °° * Ry) which is due to the perturbation ts obtained additively 
“om terms of the form 
fc ss Dil Py) + te(Pi) + > + pe Py) | ee 
P.P, 1 f 
rhere « +: 8 and the denominator is the distance between the 
wo points P, and Ps. The orthogonality of the # tells us that 
his integral can be non-vanishing only if the permutation s, 
rhich sends the set of indices & into the set 2 (both of which 
re permutations of 1, 2, °° -, f), is either the identity or the 
ransposition (a8). In this latter case we find 


a(s) = Eup = [feted Pol PP ay ay” 
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On the right-hand side of (12.16) we then have only the terms 
arising from s = I and the transpositions s = (af) : 


§ = {a(l) — 2 EaalaB) }P. (12.17) 


Dirac has given a remarkable formula for the transposition 
acting on a spin tensor. Let ©* be the spin of the «* electron ; 
S*, S§, S¢ are then the operators 


0 lj jo —4 1 0 
1 o| |e op , = 
acting on the a'” index of the tensor f(tyt,- ++ ty). On calculating 


in particular 
(S'S?) = S2S7 + SSy + SESz 


(which should perhaps be written (©! x ©*) instead, since ©! 
affects only the first index and ©? only the second), we find that 
it is the operator 


ty le 


0 1 2};—1 
1 1 


acting on the first two indices, all other places being 0. Hence 
(1 + ©'167)} is the substitution 


(00) > $(00), A(11) > (11); P(10) > (01), G(01) > (10) 
or the transposition of the first two indices. The energy (12.17) 
may then be written in the form 


a 5 aol SS") (12.18) 


This may be interpreted as saying that the coupling between 
: 1 

the electrons « and B is responsible for the term — 5H aa(Sr€*) 

in the energy operator. However, the constant Ey does not 

represent the energy of the unperturbed system.'® 
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C. Expricir ALGEBRAIC CONSTRUCTION 
§13. Young’s Symmetry Operators 


We now supplement the general theory developed above 
by an explicit algebraic construction of the irreducible repre- 
sentations of the symmetric permutation group 7 = 7y,. This 
problem is, as we know, equivalent to that of constructing the 
primitive symmetry classes of tensors of order f by means of 
idempotent symmetry operators e; here a ‘ primitive’’ sym- 
metry class is one such that the symmetry of the tensors be- 
longing to it cannot be further increased by the addition of 
further symmetry conditions—such an additional condition 
either reproduces all the tensors of the class or reduces them all 
to 0. This construction is due to A. Young and G. Frobenius 1° ; 
with its help we are able to verify step by step the entire theory 
of representations of the symmetry group in an explicit and 
elementary manner. 

We are already acquainted with two very simple processes 
which yield tensors of maximum symmetry: ‘‘ symmetrization,” 
by means of which the tensor F yields the completely symmetric 
tensor J’sf, and ‘alternation,’’ which sends F into 36,° sF. 


8 8 
The first of these processes can be readily generalized as follows : 
We divide the range from 1 to » of the ‘‘ variables"? 2,2. + + * ty, 
on which the general tensor component F(1,t). - + + 17) depends 
(or, what amounts to the same, the sub-indices 1, 2, -- -, f), 
into sub-sets of lengths ff, fo,°°°; fA; tfet::: =f. We then 
symmetrize with respect to the indices of each of these sub-sets. 


Een 
j alsa alta 
Ihe dee 
|_| 


Pattern 7, 5, 4, 4, I. 


This distribution into sub-sets may be readily visualized with 
the aid of a ‘‘ pattern’’ P = P(f,, fo, - + *) as illustrated in the 
accompanying figure (for the pattern P(7,'5, 4, 4, 1)]; each of 
the f squares in the pattern is occupied by a different one of the 
f integers 1, 2,---+, f. Each of the sub-sets mentioned above 
constitutes a horizontal row of the pattern, and the various rows 
are arranged one under another. The individual sub-sets may 
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be arranged in order of decreasing length: f, 2 f, 2:°°+°*; the 
pattern then consists of non-interrupted vertical columns as 
well as non-interrupted horizontal rows. Those permutations 
p which permute the members of each row among themselves 
constitute a sub-group (p) of a of order f,!f,!+ +--+ [denoted in § 8 
by m(fi, fo, °° *)]. The symmetry operator described above, and 
which is to be applied to an arbitrary tensor, is 


a= J'p; 
Pp 


henceforth p will always denote an arbitrary permutation which 
sends no numeral of one row into another row. 

So far we have made no use of the process of alternation. 
If after having symmetrized with the aid of the operator a we 
alternate with respect to certain of the variables or sub-indices 
1, 2,:°°, f, we certainly obtain 0 if any two of these numerals 
are in the same row, for the tensor obtained by the symmetriza- 
tion is symmetric with respect to any two such numerals and 
the result of subsequently alternating with respect to them must 
be 0. To avoid this situation we choose one variable in each of 
the rows and alternate with respect to them; since the order 
of the variables in éach row is so far immaterial we may place 
these chosen variables in the first column. We then disregard 
the first column and proceed to alternate with respect to a set of 
variables obtained by selecting one from cach row of the re- 
mainder of the pattern; these variables may now be shifted into 
the second column. This process is continued until we have 
covered the entire pattern; the result is that we have symmetrized 
with respect to the rows and have followed this symmetrization by 
alternation with respect to the columus. Let q denote an arbitrary 
permutation which permutes the variables in each column among 
themselves; these gq constitute a certain sub-group (q) of 7. 
The alternation described above consists in applying the sym- 
metry operator 


b= 35,°q, 
q 


and the entire process consists in applying the resultant operator 


c=ba= 3¥6,:- qp. 
Pq 
We call c the Young symmetry operator bclonging to the 
pattern P. 
In order to obtain a unique symmetry operator c associated 
with a given pattern P we must specify the way in which the 
numerals from 1 to n are to be distributed in P: they shall be 
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introduced in such a way that on reading the pattern, as one 
would read a page of a book, they appear in their natural order 
1,2,---,f. If we write them in any other order, say that ob- 
tained from the standard form with the aid of the permutation r, 
we obtain a ‘‘ conjugate’’ element c, which, as is readily seen 
on considering the relation between the tensors generated by 
these two operators, is related to c by 


C,r=re or ¢,(s) = c(r“}sr). 


Hence the introduction of 7 results merely in a new name. 

From now on we operate with symmetry quantities, 1.e. 
elements of the algebra (7), instead of tensors; we consider the 
invariant sub-space p, of t consisting of all elements of the form 
y == xc and the representation h, of w induced in it by the regular 
representation. With p, is associated the symmetry class , 
of all tensors of the form cF. If we replace ¢ by one of its con- 
jugates c, we obtain instead of p, an equivalent invariant sub- 
space ; in this sense the order in which the variables are written 
in the pattern is quite immaterial. We hope that , is irre- 
ducible and that the totality of representations , associated 
with all possible patterns constitutes a complete set of inequl- 
valent irreducible representations of 7. This hope ts strengthened 
by the fact that the total number of patterns is just equal 
to the number of inequivalent irreducible representations. To 
show this we note that the number of patterns is cqual to the 
number of partitions of f into integral non-negative summands 
f=fitf/e+°:°:.- which satisfy the condition f, 2 fp =- °°. 
On writing 


fi-—fe=n, fe—fs=r, °° 


we see that this number ts equal to the number of solutions of 
the equation 


Ir, + 27g + 3873+ - °° =f 


for non-negative integral ry. But we have already scen that this 
is the number of classes of conjugate elements in w and, by the 
general theory, is therefore equal to the number of inequivalent 
irreducible representations of 7. 

If the dimensionality of the vector space is less than f 
the only non-vanishing symmetry classcs are those arising from 
patterns containing at most m rows, for if the first column is 
longer than alternation with respect to the variables standing 
in it alone causes an arbitrary tensor to go over into 0. The 
only patterns which we need in this case are consequently those 
obtainable from the algebra ro, instead of t, where ty = {9 as 
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defined in §2 above. The number of inequivalent irreducible 
invariant sub-spaces into which the tensor space #/ can be 
reduced is accordingly decreased to the number of partitions 
of f into ” integral summands / =: f; + fe + °° ° + f, for which 
ee = faa 0. 

A permutation s = qp which is obtained by composition 
from a permutation p of (p) and a permutation q of (q) can be 
so obtained in only one way. This is an immediate consequence 
of the remark that the equation gp = J can be fulfilled only by 
p=, a=], for it asserts that p = q™! belongs to (p) as well 
as to (qg). The components of the symmetry operator c can 
therefore be described as follows: c(s) = 9 unless s belongs to 
the set (q)\(p); when s belongs to this set c(s) = +1 according 
as the unique decomposition s=qp yields an even or an add 
permutation q. 

We must now prove the following three assertions con- 
cerning c : 

(1) ¢ 1s essentially idempotent ; or, more precisely, c satisfies 
an cquation cc = y:+c¢, where y 1s a non-vanishing numerical 
factor. Furthermore, y 1s an integral positive number which 
is a factor of f!. Then replacing e by e =: c/y, e is idempotent. 

(2) The sub-spact p, is irreducible, the e introduced in (1) 1s 
primitive. 

(3) Different patterns lead to inequivalent sub-spaces p,. 

The execution of this programme depends upon a simple 
combinatorial auxiliary theorem, which we now proceed to 
develop. Denote the lengths of the columns in the pattern 
P with rows of lengths fi, fe, °° * by ff, fo, --°: 


h2fh,2:° if 2fe 2° Pes, 
ieee aaa ae ae 


-We think of the pattern P as cut out of a rectangular chess- 
board consisting of ff horizontal rows and fj vertical columns, 
and the permutation s as operating on f chess-men occupying 
the f fields. On interchanging rows and columns in P we obtain 
the dual or transposed pattern P*. 

Auxiliary Theorem. A permutation s belongs to (qp) if and 
only tf any two pieces originally in the same row are not sent into 
the same column by s. 

Proof. It is evident that this condition is necessary in 
order that s belong to (qp). The change of position which one 
of the pieces suffers as a result of s can be accomplished in two 
moves, a horizontal and a vertical move (in this order). It 
is at first conceivable that the horizontal move could send the 
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piece into a field of the original board which is not contained in 
the pattern P. If the decomposition s = gp is possible p must 
represent the horizontal move and q the subsequent vertical 
one; it is clear that q and p are thus uniquely determined. 
Now if s satisfies the conditions enunciated in the above theorem 
the horizontal move can never throw them into the same column, 
i.c. the same field. It only remains to show that the horizontal 
move can never send any piece out of the pattern proper, or: 
those pieces which s sends into a column of length f* come from the 
first f* rows of the pattern. We divide the chess-board horizontally 
into an upper and a lower part, the upper consisting of the 
first f* rows. The pieces which are sent into the first column 
by s are, by assumption, from ff different rows; hence there 
are at least (and therefore exactly) ff — f* of them which come 
from the lower part of the board and not from the first /* rows. 
Note that fy — /* is exactly the number of fields in the first 
column which lie in the lower part of the board. On applying 
this argument to each column in succession we find that the 
number of pieces which s sends into those columns which pro- 
trude into the lower part of the board is exactly equal to the 
number of fields in this part of the board. Hence all the pieces 
in the lower part of the pattern are sent into columns whose 
lengths are greater than f*, and the only pieces s sends into a 
column of length f* come from the upper part of the board. 

This auxiliary theorem allows us to assert that if s does not 
belong to (gp) then there exist two pieces in a single row which 
are sent into the same column by s. If u denotes the trans- 
position of the two pieces in their initial positions and v their 
transposition in the final then su = vs; here u belongs to (p) 
and v to (q). 


§ 14. Irreducibility, Linear Independence, Inequival- 
ence, and Completeness 


We now examine the Young symmetry operators c associated 
with the various patterns. Obviously 


c(sp) = c(s), c(qs) = 6, ° c(s), (14.1) 


where p, q are, as usual, elements of (p), (q), respectively.2’ 
Theorem (14.2). Any element a of (a) which satisfies equations 
(14.1): 
a(sp) = a(s), a(qs) = 8, - a(s), (14.3) 


ts a multiple of c. 
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To prove this theorem we first note that (14.3) implies 


a(gp) = 8q° a(l) ; 
1 setting a(l) = A the equation 
a(s) =A <c(s), 


hich is to be proved, is certainly correct for all group elements 
of the form qp. We must next show that a(s) = 0 if s does 
»t belong to the set (qp). Such an s implies that there exist 
anspositions # and v, lying in (p) and (gq) respectively, for 
hich su = vs. But then by (14.3) 


a(su) = a(s), a(us) = 6,° a(s) = — a(s), 


hence a(s) == —- a(s) or a(s) = 0. 

Theorem (14.4). Every element of (a of the form cxc 1s a 
ultiple of c. 

It was shown in the general theory that this theorem is 
alid if ¢ is a primitive idempotent element of (7) and if 
ie field in which we operate is algebraically closed; here we 
»proach it from the opposite direction, as we wish to show 
rectly that it holds for c in order to prove that ¢ is primitive. 
ow obviously any element of the form xc satisfies the first of 
juations (14.3) and any element cx the second; hence any 
ement of the form cxc has both properties and is consequently 
multiple of c. 

Theorem (14.5). cc = ye and y its a positive integer which 
contained 1n f!. 

That cc is a multiple of c follows immediately from the 
revious theorem; y 1s therefore the number 


y = Le(t)c(t) = Nels) + c(s~). 
it’ = | 8 


et the sub-space p, of elements of the form xé be of dimension- 
ity g. The projection 

x—>y=xl (14.6) 
royects any element x into an element lying in this sub-space 
id is, within p, itself, merely the multiplication y = yx. Its 
‘ace 1s therefore yg; to see this we need merely to adapt the 
)-ordinate system in group space to the sub-space »,. On 
1e Other hand its trace is immediately obtainable from (14.6) or 


y(s) = ax{t)e(s- ; 


is f!c(l) = f!, hence 
yg = fi. 


364 THE SYMMETRIC PERMUTATION GROUP 


Consider the meaning of this fact that y is positive, i.e. that 
c(s)c(s~1) is oftener positive than negative ! 

e=c/y is idempotent; hence the character of the repre- 
sentation §, induced in p, by the regular representation is 
by (8.3) 


> 
a 
| 


E Se(r-sr). (14.7) 
Yr 

We obtain as a by-product the fact that the dimensionality g 
of the representation , 1s a factor of f}!. 

Theorem (14.8). $, 1s irreducible. 

We know already that this theorem is a consequence of (14.4), 
but it may be instructive to prove it directly as follows. Let 
e = c/y be reduced into two independent idempotent elements 
é, t+ e,; then 


€é, == €)€ = e,, whence eee = ey. 


Now by theorem (14.4) any element of the form ee,e is a multiple 
of e; hence e, = Ae. ee; = e, then yields the equation A? = dA 
for the number A. Consequently either A= 1 or A=0O, i.e. 
either e, = e or e; = 0. 

We shall say that the pattern P’ with rows of lengths 
ti, fo, °° * is higher than P if the first non-vanishing difference 
fi —fu fe — fa, «++ is positive. 

Theorem (14.9). If the pattern P’ 1s higher than P then 
cco = 0. 

We do not here assume that the variables are written in 
the patterns P, P’ in the normal form agreed upon in the previous 
section—1.e. in which the numerals appear in their natural 
order on reading the pattern as one would a page of a book. 
The proof is based on the fact (I) that there exist two numerals 
which are in the same row in the pattern P’ and in the same 
column in the pattern P. If v is their transposition it belongs 
to the group (p’) associated with the rows of P’ and at the same 
time to the group (q) associated with the columns of P; hence 


c'(sv) = c'(s), cvs) = — e(s), 
On replacing vé in 
c’c(s) = De" (st c(t) =: — Dre’(st)c(vt) (14.10) 
by ¢ alone we find | | 
c’c(s) = — Die(st Ma)e(t) = — Je'(st)c(t) = — c’c(s). (14.11) 


é 
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(F) is evident if the first row of P’ is already longer than the 
first row of P, for it is impossible to distribute the f,’ numerals 
in the first row of P’ over different columns of P if fy < fy. 
If f; =f; and the numerals of the first row of P’ are actually 
distributed over different columns of P, we discard the first 
row of P’ and the f, fields of P containing the same numerals as 
this row. On shifting the fields of P upward to fill in the gaps 
P is transformed into a pattern which has exactly the same 
appearance as if we discarded the first row of S; we are only 
interested in the fact that this process leaves all pieces in their 
original column. The proof can then be completed by mathe- 
matical induction—by assuming that it holds for the abbreviated 
patterns obtained by omitting the first rows of P and P’. 

Theorem (14.12). Let c, c’, - + + be the Young symmetry 
operators associated with different patterns P, P’. +++; the corre- 


sponding sub-spaces P-, Por, ° + + are then linearly independent. 
Let the P, P’, P’, +++ be arranged in such an order that 
P is higher than P’, P’ higher than P”’, +--+. An element x of 


p =p, is reproduced by right-multiplication with c/y but, by 
the previous theorem, this process transforms all elements 
x of p’, x” of p”,-.- into 0. Assume there exists such a linear 
dependence 


Ree pe bee re 0 


on right-multiplication with c we find x = 0 and consequently 
x +x’ +-+++=0. The theorem is thus reduced to the 
same theorem for the smaller set P’, P’, - +--+, and the proof 
follows by mathematical induction. 

Theorem (14.13). Different patterns P, P' give rise to in- 
equivalent sub-spaces Po, Der. 

The proof is accomplished by a direct derivation of the 
orthogonality relations. Let P" be higher than P. Since we 
did not assume in proving theorem (14.9) that the numerals 
were distributed in the same order in the two patterns P and P’, 
we may replace the element ¢ with components c(s) by the 
‘“ conjugate ’’ element c,-1 with components c(rsr7}) : 


yc (st )c(rir™) = 0. 
t 
Summation with respect to r yields 


Dic (st) * Xe(t) == 0. 
t 


On writing x = xe, x’ = xe this formula is equivalent to 


Ex'(s)x(0) = 0. 
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In particular 
Ex(C)x(d) = 0. 


If the two sub-spaces were equivalent we would have y'(t) = x(@), 
and since y(t) = y(t) for the symmetric group the above 
equation would yield 


2x*(s) = 0. 


But this is impossible, for by (14.7) the character y(s) has 
rational components, and in particular x(1) = g + 0. 

This last conclusion is valid only if the number field in which 
we operate is non-modular ; naturally this restriction 1s irrelevant 
for physics. Nevertheless it constitutes a blemish which should 
be removed, for the remainder of our deductions only introduce 
the minimum assumption that f! is not 0 in the field under 
consideration. Now from the general theory we know that 


Theorem (14.14). oD Xx(S)x(s74) = fi. 


The blemish mentioned above is removed by proving this 
theorem directly. We must show that 


axis *) es) = 1 
or 
Dewrs vy *\e(s) == 1, 


r,8 


On replacing the summation variable s by sr, where + is fixed, 
this becomes 


a e(srje(s'r-1) = 1, (14°15) 
Consider next the function 
a(s, s’) = De(srje(s'r7) ; 
as a function of s it satisfies the second condition in (14.3). 


But the first of these conditions is also satisfied, as can be seen 
immediately by replacing 7 in 


a(sp, Ss’) = SLe(spr)e(s'r~) 
by the summation variable p-'r._ Hence by (14.2) 


a(s, s’) = c(s)° Le(rje(s'r™) = ¢(s) + e(s’) = -c(s)c(s’) 
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and therefore the left-hand side of (14:15) or 


dias, ee ; 2ic(s)e{s™) 


is actually equal to 1. 
The relations 
Dx(s)x'(s74) = 0 or f! (14.16) 
8 
show that the primitive characters obtained by our construction 
from the various symmetry patterns are linearly independent, 
and since their number is equal to the number of classes of 
conjugates in the group 7, any class function can be represented 
as a linear combination of the y(s).. In particular, the function 


I(s), which is 1 for s = | and otherwise 0, must possess such 
an expansion : 


f+ 1(s) = myx(s) + m'y'(s) ++ °°. (14.17) 


Multiplying by x(s~!) and summing over s we obtain, with the 
aid of the orthogonality relations (14.16), the equation 


+ ftx(l) = flim 


or 
| m = | (14.18) 


form. Since 


equation (14.17) gives the reduction of the modulus 1 into 
primitive idempotent elements e,. Hence the regular repre- 
sentation is reduced into the irreducible representations h, 
associated with the various symmetry patterns. Since f! 1(s) 
is the character of the regular representation, eq. (14.18) is a 
direct verification of the fact—proved in the general theory—that 
the number of times each irreducible representation appears 
in the regular representation is equal to its dimensionality. 
This completes our direct and elementary development of the 
theory of the representations of the symmetric group. 

The method of proof employed in establishing theorem (14.9), 
1.e. that cc’ = Oif P’ is lower than P, will now be used to answer 
another question. Let a be the operator, introduced in the 
previous section, which symmetrizes with respect to the ciphers 
occupying the rows of P: 


a(s) = 1 or 0 according as s belongs to (p) or not, 
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and let the numerals be written in the pattern P’, which is 
lower than P, in an arbitrary order. I assert that ac’ = 0. 
There exist two numerals which occupy the same row in P 
and the same column in P’. If v is the transposition of these 
two numerals then 


a(sv) = a(s), c'(vs) = — c’(s), 


and the assertion is proved with the aid of (14.10), (14.11) on 
replacing c’, c there by a, c’. Hence also 


do a(st)c'(rir~!) = 0, 
t 


Last )y’(t) = 0 or Sal(r)y’(rs) = 0. 

t r 

That is, the sum of the x’(t) extended over all elements t = rs 
which are left-equivalent to s mod. (p) [i.e. 7 in (p)], is zero. 
In particular, D’y’(s) = 0, where the sum is extended over all 


elements s of (p); x’ 1s the character associated with a pattern 
P’ which 1s lower than P. On applying this result to the con- 
siderations of § 8 (in particular, to (8-13) ff.) we find : 

If the ndividual I has the simple energy levels E,, Ey, +++ the 
term 


fbi + fee +° oe (fi 2fe 2° ty JP jee oo - =f) 


of the unperturbed system I! appears only in those symmetry 
classes of tensors whose pattern P’ = P(fy’, fe’ + + -) is not lower 
than P= PU ijee >): 

Thus we saw in discussing the two-electron problem that 
terms of the form E, + &, appeared in the ‘ anti-symmetric ” 
as well as the ‘‘ symmetric ’’ term systems, whereas terms such 
as 2E, appeared only in the latter. 

Finally, we consider the relations existing between two 
dual patterns P and P* with generators c, c* and characters 
x, x*. The group (p) which permutes the members of each 
row of P among themselves coincides with the group (q*) which 
permutes the members in each column of P* among themselves ; 
similarly (q) = (p*). If s = qp is in (gp), then s-4 = pg = 
g*p* is in (q*p*), and conversely ; for such an element 


C(S) == 0,0 <6" (S20 = 05. 
Hence in general—even when s is not in (gp) and, consequently, 
sl is not in (q*p*)—we have 


c*(s-1) = 8, + c(s). 
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‘Dual ’’ elements c, c* are therefore related to each other in 
exactly the same way as the ‘duals’ introduced in § 12. 
Further 


ys =y; x*(s7t) = x*(s) = 5° x(5)5 g* = 8. 


If P is higher than Q, then conversely P* is lower than Q*. 
For if we lower P by taking away the last field of one of the 
rows of P and adding it to the end of a later (shorter) row, one 
of the columns of P is increased at the expense of a later (shorter) 
column; by such a process of shifting individual fields, in which 
no gap is to occur in a row or a column, P can be transformed 
into the lower pattern Q. 


§ 15. Spin and Valence. Group-theoretic Classification 
of Atomic Spectra 


If the vector space ft = WR, 1s only 2-dimensional, the only 
symmetry patterns P which give rise to primitive symmetry 
classes of tensors of order f are those which consist of at most 
two rows. Let the first row contain /+ v fields and the second 
f; then 


) 


v== f — 2. 


The symmetry pattern P is thus uniquely characterized by the 
number v, which we call its valence, and v may assume any of 
the values f, f — 2, f—4,°-:. Let $B, be the totality of tensors 
of the form cF obtained by applying the Young symmetry 
operator c associated with the pattern P to the totality of tensors 
F, and let §, be the representation of the linear group, the 
substratum of which is the tensor manifold §8,. A sufficiently 
general tensor of order f which 1s symmetric in the first as well 
as the second rows of indices 1s given by 


exerxe-++xexez (t+ terms) 
ye > Ge | ie aa ae | (J terms), 
where 
L== (%1,%2), Y= (Yi, Ve) 


are two arbitrary vectors. On alternating with respect to the 
columns we find that the representation , of the linear group 
¢ = ¢, is that one which is induced on the quantities 


(%1V2— Xeya)' xa (11 + 72 = 2). 


Hence §, is the representation of the linear group which was 
denoted in ITI, § 5, by &. 
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This remark supplies the connection with the symmetry 
problem of quantum mechanics as dealt with in § 12—on apply- 
ing the Pauli exclusion principle when the existence of the spin, 
but not its dynamical effect, 1s taken into account.}8 Since 
the spin space is 2-dimensional, formula (12.10) tells us that 
the only patterns P which give rise to a term system are those 
whose duals P* consist of at most two rows, 1.e. those P which 
themselves have but two columns. If v 1s now the number of 
fields by which the first column of P exceeds the second we call 
v the valence of the term system or of the corresponding state of 
the atom. The multiplicity of the term system with valence 
v is u-+ 1, and to each of these possible multiplicities corre- 
sponds but one term system as we have already seen in § 12 
(in particular p. 356). We previously (Chap. IV) called s = v/2 
the ‘“‘ spin quantum number.”’ 

The fact that the longest column of P cannot exceed the 
dimensionality N of the vector space §, associated with the 
electron translation may result in a further restriction on the 
possible symmetry patterns P. This situation cannot arise 
as long as we deal with the total oo-dimensional system space. 
On the other hand if we restrict ourselves, for example, to those 
states of the electron which are characterized by a fixed principal 
quantum number » and a fixed azimuthal quantum number I 
—and which therefore constitute a (2/ + 1)-dimensional sub- 
space R(nl) within R,—1i.e. if we consider only those states of 
the atom in which all the f electrons outside a closed core are 
in R(nl), the dimensionality N is reduced to 22+ 1. Then f 
cannot exceed 2(2/ -+ 1) and the possible valences of the states 
under consideration are given by the following table : 


fi. O, BA. wheres 4) 4l+1, 4142 
: oO de. @ kuoads 0 i OT 
ij a a a 9 
4 


This table again gives us the alternation law, but shows that in 
addition the number of possibilities decreases from the middle 
of the table on. The possible multiplet numbers 2s + 1 of 
terms in these states is one greater than v. 

This ‘‘valence"’ v, which describes the symmetry state of 
the system, 1s actually the chemical valence, as was shown by 
F. London.® We allow two atoms, consisting of f;, fe electrons 
respectively, to come togethér to form a molecule with f= f, + f, 
electrons. Let $8), $8, be irreducible invariant sub-spaces of 
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the system spaces Rt, Rf respectively. In order to find which 
symmetry states the molecule is capable of assuming when the 
first atom is in the state $8, and the second in $8 we must com- 
pletely reduce the space 8, X 5B, into its irreducible constituents. 
If we consider this decomposition as taking place in the vector 
space of electron spin rather than in that of electron trans- 
lation (the justification for which will be given below), the 
problem is solved by the Clebsch-Gordan series (III, 5.9); it 
tells us that if the valences of the symmetry states of the two 
atoms are v,, UV, the resulting symmetry states of the molecule 
are those with valences 


V == UV, + Ue, Vy + Ve — 2, vy + Vg — 4, °° 3°, lv, — V9. (15.1) 


This situation can be readily visualized in terms of the symmetry 
patterns as follows. Bring the two symmetry patterns P,, P, 
of the two atoms into the positions shown in 
the accompanying diagram and then shove 2 
vertically upwards, one field at a time, until one 
of the two columns of the combined pattern is 
closed; each of these steps represents a possible 
symmetry pattern ¢or the molecule, in which v is 
the number of fields which are not paired hort- 
zontally. The saturation of the valence bonds 
here appears as the pairing of fields or, more physi- 
cally, as the saturation of the spin of an electron 
in one of the atoms with that of an electron in the - 
other. The empirical theory of the valence bond = |—|— 

has therefore a rather profound significance. —|— 

We have yet to justify our use of spin space 

rather than translation space in the above. Let the representa- 
tion of the permutation group zw, corresponding to the two- 
columned symmetry pattern of valence v be denoted by b,; its 
dual )} consists of but two rows. The Clebsch-Gordan series, 
together with the third reciprocity theorem of § 10 as applied to 
the linear group ¢ = Cy, tells us that on restricting 7 to the sub- 
group a’ = 7m, X m, which permutes the electrons of each atom 
separately the representation )} of w contains the irreducible 
representation he, x he, of 7’ once or not at all, according as 
v is one of the values (15.1) or not. From this it follows im- 
mediately that the same result holds for the duals on reducing 
h, after restricting m to mw’. Applying the same reciprocity 
theorem in the opposite direction for the case in which c= ¢, 
is the linear group in m dimensions, we find that the representa- 
tion $,, X 9, of ¢ (or the algebra 2) contains the representation 
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§, once or not at all according as v is one of the valucs (15.1) or 
not. On reducing §,, X §,, into its irreducible constituents 
we may expect to find other representations—which may even 
occur more than once—in addition to these simple §),, but these 
additional representations will correspond to symmetry patterns 
with more than two columns and are, in virtue of the Pauli 
exclusion principle, of no importance for physics. The number 
b introduced in § 11 1s accordingly at most equal to 1 in the case 
of diatomic molecules. 

Molecules which consist of a larger number of atoms can 
be studied by the same method. If in particular we are in- 
terested in the case of three atoms and their valences are v, Us, Us, 
we can determine with the aid of the Clebsch-Gordan series 
the number J, of times the representation ©, occurs in the 
reduction of ©,, x ©, x ©,,. Those v for which b, +0 are 
the valences of the possible symmetry states of the molecule 
and b = b, (which may here be greater than 1) are the corre- 
sponding multiplicities. The characterization of the quantum 
and symmetry states of a molecule which is formed by the 
union of three atoms in given quantum and symmetry states 
requires, in addition to the valence v, a further index which 
distinguishes between the various 0, possible energy levels. 
But this description of the various possibilities differs from the 
empirical theory of the valence bond—the manifold of possible 
bindings 1s smaller.?° 


Classification of Spectral Terms. 


Let the unitary or the complete linear group ¢,, in the system 
space of the single electron be restricted to the group ¢, cy, 
of transformations S, x S,, the two factors of which are trans- 
formations of the spin and translation spaces R,, Rt, respectively : 
R= MR, x R,. The space {R/} of anti-symmetric tensors of 
order f is then reducible into irreducible invariant sub-spaces 
with respect to the algebra of symmetric transformations of 
the form (12.2). We thus obtain a distribution (1) of spectral 
terms among the various symmetry classes; this step is of 
universal validity and is applicable to molecules as well as 
atoms. 

The further classification of terms, as discussed in Chapter IV, 
A, refers to “ simple’”’ rather than “ quantum ”’ states, 1.e. to 
those states which are related to spatial rotation and moment 
of momentum in the same way that the quantum states are 
related to displacement in time and energy. Naturally this 
application of the rotation group D = Dd, (the elements of which 
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we now denote by o, 7, + « +) is significant only for atoms (or 
ions), the nuclei of which are considered as fixed centres of 
force. So long as we concern ourselves only with the electron 
translation and neglect the mutual perturbations of the electrons, 
which are characterized by principal and azimuthal quantum 
numbers » and J, each individual term of the system is char- 
acterized by the quantum numbers (74, 1,3 79, ls | San 1 eh 
The number of times such a term appears in a given symmctry 
system 1s equal to the dimensionality of the linear sub-space 
in which the atomic states under consideration lie. ‘The resolu- 
tion caused by the mutual perturbations parallels the reduction 
of this sub-space into its irreducible constituents Ry with respect 
to the group D0 of rotations; the resulting components of the 
term have the natural multiplicities 22 + 1. The spin space is 
similarly to be reduced. Let Dd induce the representations 
h,:¢0—> U(o) and ©: ¢ > V(o) in R, and WM, respectively. This 
second step (II), in which the spin and translation spaces are con- 
sidered separately, is interpreted from the stand-point of group 
theory as meaning that we associate with the element (o, 7) 
of DX D the transformation U(o) x I(r); we thus obtain a 
6-parameter sub-group of ¢, X ¢,, and on restricting ¢, X C, to 
this sub-group our original irreducible sub-space is further 
completely reducible into irreducible constituents. The trre- 
ducible representation of D x D induced in such a sub-space is 
of the type, & Q,. The final step, (III), consists in introducing 
the coupling o=7: the 6-parameter sub-group ts_ thereby 
restricted to a 3-parameter sub-group, 1.e. that sub-group 
induced in the total system space by the rotations 0. The 
spin perturbation then resolves cach such term multiplet into 
its (at most 2s + 1) components: 


DX OD, =20, G=lt+s,l+s—1,--+, [[—sh); 
J 


naturally D, x ®, is here a representation of D instead of D X D. 

Actually v= 2, and the transformations induced 1n_ the 
spin space Jt, by the rotation group constitute the unitary group 
in two dimensions. Consequently the transition from ¢, to }, 
in step (II) involves no reduction in spin space—this 1s the 
essential simplification caused by the fact that , has so small 
a dimensionality. 

To the symmetry system of terms corresponds a certain 
irreducible representation of the unitary group u in the space 
KR, of the electron translation and with it a certain irreducible 
characteristic (§ 9) 


X a X(é,, Ea, carl :). 
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The co-ordinates x; in the space t, are broken up into classes 
in the manner described in Chapter IV, § 1: 


x“ (m) mah bed, * 2 eed 
x'(m') [mM =e —1,-+-, —0]; 

Each of these classes describes a (2/ + 1)-dimensional sub-space 
Ri(nl) of R, in which the group 0, of spatial rotations induces 
the irreducible representation 2, and is characterized by the 
principal quantum number 7 and the azimuthal quantum number 
/, The arguments e, of X are correspondingly broken up into 
classes. To give the principal and azimuthal quantum numbers 
of the individual electrons—without stating how these numbers 
are distributed among the f electrons—we need only to state 
how many (f’) electrons are represented by states in each of 
the various sub-spaces f’ = R(nl). If, for example, 3 of the 
electrons are in §’ and the remaining 5 in Rt” (f = 8) we must 
separate out that part of X which is of degree 3 in the variables 
e, belonging to ’ and of degree 5 in those belonging to R”. 
The multiplicity M of the corresponding term 


E(myly) + E(nele) ++ + + + E(nyl;) 


of the “unperturbed ’”’ atom in the symmetry system under 
consideration is then obtained from the part of X described 
above by setting all ¢ contained in it equal to unity. In order 
to determine how this M-fold term is broken up on taking the 
mutual influence of the electrons into account we replace the 
variables e(m) of the class (nl) by e(m) = e™, the variables 
e’(m’) of the class R(n’e’) by e’(m') = e™ (with the same 6), etc. 
The resulting expression must be a linear combination of the 
sums 


with non-negative integral coefficients. This enables us to 
tell which of the various total azimuthal quantum numbers L 
appear, and how often, in the resolution of the above term ; 
each such L-term has still the multiplicity 22 + 1. 

Example. We consider, as an example, the case in which 
f = 3 and all three electrons are in the same sub-space (ml). 
The possible symmetry patterns are 


-- as sem 
PJ 
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The Pauli exclusion principle allows only the first two; their 
valences are v = 3 and v = 1, and the corresponding terms are 
therefore quadruplets and doublets, respectively. The first 
pattern defines the anti-symmetric tensors of order 3 and the 
third the symmetric tensors. The corresponding characteristics 
are therefore 


X= 5, = LE, 8; 8%, Xy = A EE Ex. 


t<j<k twSjI2 
On introducing 
2 3 
Sg = O8; Ej, $3 >= g; 
t+) 


we have X, = S$; + So -+ 53. The dimensionalities of the re- 
presentations of a3 corresponding to these three patterns, and 
therefore the numbers of times the representations X,, Xe, X; 
of ¢ appear in (c)’, are easily shown to be 1, 2, 1, (in accordance 
with the equation 3! = 1* -+ 22+ 1%). Now the characteristic 
of the representation (c)® of ¢ is 


ty = (Le,)? = 53 + 352 + 65, ; (15.2) 
the equation é 
ty = X, + 2X_ + Xz = (25, + Sg + 53) + 2X, 
then allows us to conclude that 
Xo == Sp + 25). 


We prefer to carry out the evaluation with the aid of the sums 
of powers 


2 a 3. 
ey, fo oF PZ : 281, ts nad D8; ) 
t t t 
we then have 


fo = Sz -+ Sg, tg = Sg 


in addition to (15.2). Consequently the characteristics in 
which we are interested are : 


Doublets : Aa ah ~- ts), (15.3) 


Quadruplets: X, = 5| g(t — ts) — (ty — ts) |. (15.4) 


The solution of the problem discussed above is now obtained 
by replacing the 2/ + 1 variables €; by the set 


et ae e e a ew 
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and then expressing ¢,, fa, ff as a sum 2a,(l.) of expressions of 
the form 


(L): e@+ eh 14. ;: -t el 


with integral multiplicities az. The computation Is considerably 
simplified by multiplying both sides of the equation by ¢ 4, 
as (L) then becomes €”'! — e~4, The multiplicities so obtained 
are given in the following tables : 


ty L == 3, 81—1,3i—2,++-,¢[L=0,1,2,-° +1 
Multiplicity : ia ae nr . Agee ee 
(increasing by 1 each step) (increasing by 2 
each step). 
| ty | L = 31, 31-1, 31 — 2, 81--3,- + d 
Multiplicity : 1, 0, | Oe Sepa 


’ 
(alternately 1 and o) 


L=1i—1,1—2,1-3,- ++, 0 


a ee a i dh 


L293), fo 2h 
(alternately 1 and — 1) 
rea | = 3, 3 — 1, 31 — 2, 31 — 3, 31 - 4, 1-5, 
Multiplicity . 1, —1, 0, 1, —- |, 0, 


(repetition with pericd 3) 


On applying these results to the computation of X,, X, with the 
aid of (15.3) and (15.4) we find that the number of terms with 
total azimuthal quantum number L is as given in the following 
tables : 


Doublet System 
L = 0, 1, A 3, 4, Q, o 8 
(1) se BM i Sala 


O12{23 4 ]|--- 

up to L= 1. The period is here 3; the multiplicities in the 
second period are those of the first increased by 2, those in the 
third are obtained from those in the second by adding 2) etc. 


a) £= 3) 3—1, 31-2 | W—-3, 2-4, 31-5, | -- 
Ee ee 


down toL = 1. The periodicity is again 3, but the multiplicities 
in each period are obtained from those in the previous one by 
adding 1 instead of 2. 

Quadruplet System. The periodicity is here 6 instead of 3 

(1) For the values of L. from 0 to U the first period of multi- 
plicities (L = 0, 1, 2, 3, 4, 5) is for even 1: 010212 and for 
oddZ: 101121. The multiplicities increase by 2 from period 
to period. 
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(2) For values of L from 31 down to / the first period ts 
)} 00101 regardless of whether L is odd or even, and the 
multiplicities are increased by 1 from period to period. 


§ 16. Determination of the Primitive Characters of 
uand a 


The guiding principle in the whole of the present chapter 
is the reciprocity between the symmetric permutation group ay, 
and the algebra 2 of symmetric transformations. But this 
latter can, as was shown in §1, be replaced by the special 
symmetric transformations induced in tensor space by the linear 
transformations of vector space and which constitute a group 
‘c)f isomorphic with the linear group c. Indeed, we may even 
restrict ¢ to the unitary group u. The algebra 2 is thereby 
referred to a group—not to a finite group, it 1s true, but to a 
closed continuous group. Now we have seen in Chapter III 
that we may expect such groups to behave in a manner entirely 
analogous to that met in dealing with finite groups, at least 
if we concern ourselves only with unitary representations. As 
a rule we find in mathematics that the continuum is more easily 
handled than a discrete manifold; the formula (9.11), which 
expresses the fundamental reciprocity mentioned above, will 
therefore better serve to compute xy from X than the converse. 

We therefore next evaluate the characteristics X of the 
continuous irreducible unitary representations of the -dimen- 
sional unitary group u by a direct method which is independent 
of our previous development. The case n =: 1 has already been 
solved in III, §8; the procedure there developed serves as 
a inodel for the present case. With this in mind we first prove 
the following auxiliary theorem : 

A continuous function f(w,, @s, * * +, w,) of absolute value 
1 which possesses the period 27 in each of the 1 real arguments 
and which satisfies the functional equation 


f((@ + @')) = f((o))f((o")) 
is necessarily of the form 
fi(w)) = e(hyw, + hyw, 4-- + + + hyw,), 


where the constants # are integers. 
On introducing the 2 functions 


filw) = f(w, 0,0, °° +, 0), folw) = f(0, w, 0,° + +, 0),-- > 
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of one variable, we are able to conclude from the functional 
equation above that 


f(wy, We, °° >) = f,(w,)fe(we) t,t. 


It therefore suffices to prove the theorem for functions f(w) of 
one variable, and this we have already done [II], § 8]. 

Every element S of the group u is conjugate to a “ principal ”’ 
element E, i.e. to a transformation of the form 


ky > byx%y (v= 1,2,+ ++, nN). (16.1) 


The numbers ¢, are of unit modulus and may therefore be ex- 
pressed as 
tw 


ee = 2 "= elo,) 


‘ ? 


in terms of the ‘“‘ angles of rotation ’’ w,, we, * + *, w, (which are 
only determined mod. 27) of the unitary transformation S. 
In order to employ the orthogonality relations it 1s necessary 
to determine the volume dS of that portion of the group mani- 
fold u whose elements have angles between w, and w, + day. 
Q1, Ao, °* *, a, being any m numbers, Ict D(a, ao, > + +, a,) denote 
the product ; 

II (a; — a,) = ae, Ree A. 1 | 

t<k 
of differences; the ” rows of the determinant on the right are 
obtained by replacing a successively by a,, ag, °° *, @,. The 
evaluation of the volume clement @S will be carried out in the 
following section ; we here anticipate the result 


dS = AAdw,dw, +++ dw,, A= D(é, &, °° +, &n). (16.2) 


The determination of the primitive characteristics of u is 
accomplished by combining the following important facts.*} 

1. Symmetry.—Each element S of u is conjugate to a prin- 
cipal element F, (16.1). Hence it suffices to determine the 
characteristic X of a continuous representation of u for such 
a principal element. / goes over into a conjugate transforma- 
tion within u on permuting the ¢,: hence X 1s a continuous 
symmetric function of the angles w, and is of period 2m in each 
of them. 

2. Arithmetic Properties —The principal elements constitute 
an Abelian sub-group of u; on compounding two such elements 
E, E' the angles w,, w, are added. The normal co-ordinates 
Vy, IN representation space $ can therefore be chosen in such a 
way that the principal elements correspond to principal trans- 
formations 


Et Ve PeVe 
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indeed, we have shown in I, § 5, that any commutative system 
of unitary correspondences can be brought simultaneously into 
diagonal form. On compounding two principal elements the 
condition that E be a representation is expressed by the functional 
equation, 


p(w, We, * * )p(w4, Ws, a *) = p(w, — Ws; Wo + Wy, sain -) 


for each of the multipliers p = p,. The auxiliary theorem then 
tells us that each p; is of the form 


e(hyw, =i i ae hn), 


where the constants A are integers. The characteristic of the 
representation is the sum of these p,; hence X 1s a finite Fourier 
series in the arguments w with integral non-negative coefficients. 
The ‘‘ weights’’ of a representation are the sets of exponents 
(hy, he, + + +, hy) of each term 


e(hyw, + how. + $ ae: 2% of h,Wn) == ght ehs oS ies te gin 


which actually appears in X. The term (Ay, hg, ++ +, A) 1s said 
to be ‘‘ higher”? than (hj, Ag,- °°, #,) if the first non-vanishing 
difference h, — hy, %, — ho, - * + is positive. 

3. Orthogonality.—For all primitive characteristics X the 


integral 
2n 2n 


es [XXAA de, -+ + dw, 
0 0 
must have the value 


Fock ven tiated (16.3) 
0 0 


These orthogonality relations suggest that we introduce the 
quantities € = A-X in place of the characteristics X; they 
are also finite Fourier series, but they are anti-symmetric functions 
of the angles w instead of symmetric ones. Ay, ko, + °°, h, being 
integers arranged in decreasing order 


hy>he>s > Mp, (16.4) 


we construct the ‘‘ elemental sum ”’ 
E(hy, ho, aon h,) ae a ae e(hyw, + hows + as + h,w,), (16.5) 


i.e. the alternating sum over the permutations of the arguments 
w; the term which we have written down is the highest one 
inthe sum. Every alternating Fourier series is a linear aggregate 
of such elemental sums; since the coefficients of these sums are 
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integers, and in particular that of the ‘‘ highest’’ term is 1, 
every alternating Fourier series, such as €, with integral co- 
efficients can be expressed as a linear aggregate of the form 


E=c-&(hy, hg, ° ' Yes Eh, hy, ° ° “eas “ie (16.6) 
with integral coefficients c, c’, + + +. Let this expansion be 
arranged in decreasing order, 1.e. in such a way that the set 
(hy, Me, * - -) of exponents is higher than (hy, hy, - - -), etc.; 
hy, ha, +++) 1s then the highest term in €. A ts itself an elemental 
sum, namely 

A= &n—1,n—2,:---, 1, 0). 
Hence if the highest term in X has exponents fi, fo, °°, we have 
h=f, + (n— )), my hy = fai +1, hy = fn; (16.7) 
in the following the numbers f; and h,; are always in the relation 
(16.7) with one another. 
We denote integration with respect to all the angles of 


rotation from 0 to 27 by a single integral sign and write dw 
for dw,dw,:+:dw,. We now calculate 


felts, he, > JEM, hb, + + *)dw ; 


the A and the hk’ are arranged in decreasing order in accordance 
with (16.4). Consequently no permutation of the h can coincide 
with a permutation of the h’ unless 


hy oe hy, hy =e ho, my hy os hi; (16.8) 


the integral of cach of the (m! )* terms in the product 


E(hy, Me, +++) Elta, Ma, + + *) 


is therefore 0 unless (16.8) holds. In this latter case those 2! 
terms, for which the permutation of the h is the same as that 
of the A’, each contribute (27)" to the integral and all others 
contribute 0; hence 


J Ella, he, + JER, yy + \dw = f (2zr)” 


according as (16.8) holds or not. Applying this in particular 
to the clemental sum A, we find 
JA Adw = V = n!} (2a)". 
On setting the expansion (16.6) in the equation 


| dw = V 
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we find Jc|? +- jo"? + -++=], Since the ¢, c’, ++ * are non- 
vanishing integers only the first term can appear in (16.6), and 
we must have c == 1 or -- 1, and since the coefficient of the 
highest term of € (as of X) must be positive we are restricted to 
the first alternative c= 1. We have thus shown that every 
primitive characteristic 1s of the form 


X on E(hy, hy, a 2) ae |e", e”2, my ghn | 
A je" ° e ° e, I 


(16.9) 


’ 


where the h,; are integers arranged in decreasing order: hy >h,>-+--. 
The function defined by (16.9) is a finite Fourier series with 
the highest term (fi, fo, ° °°, fn); the coefficient of this term, its 
multiplicity, 1s 1. 

4. Completeness.—The last question to be answered asks 
whether every function of the form (16.9) is conversely the 
characteristic of some irreducible representation of u or not. 
Our explicit algebraic construction allows us to answer this 
question in the affirmative. To show this we first note that the 
representation of order f arising from the symmetry pattern 
with (at most 2) rews of lengths fy, fo, - - +, fy, has as highest 
weight (fj, fo, ° °°; fn); this can be seen immediately by con- 
sidering the representation as generated by alternation from 
the product of # vectors, the first of which occurs f, times as 
a factor, the second fy, etc. (as in the simple case at the beginning 
of § 15). The f are here any integers satisfying the conditions 


fh2fe2°+:'2fn 29. 


On dividing the transformation corresponding to the arbitrary 
element S of u in this representation by the /” power of the 
determinant of S (/ being any fixed non-negative integer) the 
highest weight of the resulting transformation is (f, — J, 
fe—l,+++, fp —l); this simple device thus enables us to dis- 
pense with the restriction f, 20. We have thus proved that 
all irreducible unitary representations of the unitary group Uy 
are oblainable by completely reducing the representations (u)f for 
f= 0,1, 2,° ++ tnto their irreducible constituents and multiplying 
by the 1-dimensional representations 


S—> (det. S!' [J = 0, 1, £2,-° ¢]. 


We have further shown that the characteristic of the irreducible 
representation ) == D(fi, fe, °° +, fn) of order f of u, which 1s gener- 
ated by the symmetry pattern P(f,, fo, ° + *, fn), 18 given by equation 
(16.9). 
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We could also have obtained this last result with the more 
transcendental method of proof employed in steps 1 to 3. If 
we are operating in the continuum of all complex numbers 
rather than an arbitrary field the proof of the completeness of 
the irreducible representations of a finite group can be formulated 
in such a way that it can be taken over immediately for the case 
of a closed continuous group with the aid of the theory of integral 
equations. The particular application of this general group- 
theoretic completeness theorem to the group Dd, of rotations of 
a circle into itself yields the completeness of the Fourier orthog- 
onal system e*™ (m= 0, +1, +2,--:-). Its application to 
the closed group u, yields the following two facts: (1) Every 
expression of the form (16.9) is in fact a primitive characteristic. 
For if it were not it would be a non-vanishing function of position 
on the group manifold—in fact, a class function-——whose Fourier 
coefficient with respect to each irreducible representation 
vanishes ; it is indeed orthogonal to all other functions of the 
form (16.9). (2) We further find that the functions (16.9) 
constitute a complete set of orthogonal functions for symmetric 
periodic functions of w,, We, ***, @,; this result is of no particular 
interest, as it 1s a consequence of the completeness of Fourier’s 
orthogonal system in one dimension. Our general considerations 
(1) to (4) yielded so many properties of primitive characteristics 
that we were able to obtain an explicit expression for them from 
these properties alone. 

Consequences.—The assumption that h, == f, 2 0 constitutes 
no actual restriction; the characteristic is then a symmetric 
rational integral function of the ¢ of order f. The ¢ are in fact 
roots of the characteristic polynomial f(r) = det (71 — S) of 
the unitary transformation S; it is therefore possible to express 
X rationally and integrally in terms of the coefficients of this 
polynomial, and therefore in terms of the coefficients of the 
matrix S. The restriction to the unitary group can then readily 
be removed, but we shall not go further into these considerations 
here.*# 

The dimensionality of the representation X is found by 
calculating X for the unit element, all of whose characteristic 
numbers ¢, are 1. On substituting directly in (16.9) we obtain 
the indeterminate form 0/0, so we proceed as follows. Take 


w, = (n — l)w, we = (n — 2)w,° + +, Wy = Ow 
in terms of the single anglew. The determinant in the numerator 


of (16.9) is then the alternating sum of the terms obtained from 
the product 


e(hy(% — 1)w) + e(h,(n -- 2)w) - + + e(h,0w) 
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by permutations of the numbers nm — 1, n — 2,-+-, 0; it is 
therefore equal to 


[{e(hw)}r¥, + + +, fe(hen)}, 1 


or to the product of the differences of the expressions e(h,w), 
e(hyw), * + * obtained by subtracting any member of the set from 
any of the earlier members. On allowing w — 0 we have 


e(hyw) —- e(hyw) ~ iw(h, — hy). 


The dimensionality N of the representation denoted by 
(fi, fo, °° *, fx) in the above is consequently 


mee D(h,, hs, as h,) 

~ Din—1,--+ +, 1, 0)} ae 

Evaluation of the Characters of m;—Having obtained explicit 
expressions for the characteristics of the representations of U,, 
we now employ the connection between the representations of 
a, and u,, developed in § 9 to evaluate the primitive characters 
of mw, In equation (9.12) y is the character and X the char- 
acteristic of the irreducible representations of my and Uy, re- 
spectively, generated by the symmetry pattern P(f,, fo, - - *); 
in particular we must put X = O1f the pattern has more than 
n rows. The sum is extended over all possible symmetry 
patterns P with f fields. The expression (16.9) for X then allows 
us to enunciate the following rule for the evaluation of y: Let 


Xfifserc: CE _— *) (16.11) 


denote the value of the character of the irreducible representation 
b(fi, fe, °° *) of 2, which is generated by the symmetry pattern 
P(fi, fo, °* *), for an element s belonging to the class f = (2,24: - -). 
Choose an arbitrary positive integer n and construct the sums 
G1, Oo, °° * of powers of n independent variables &, €2,°* *, E, and 
the product D(€,, €s, °° *, &n) of their differences. The term (16.11) 
is then the coefficient of the term eres. -- eh [h, = f; + (n — 2)] 
in the expansion of 


D(€,, &,° * *, En) *opou#  tt. (16.12) 


We here assume that the pattern P has at most ” rows; hence 
if we wish to obtain all primitive characters of a, we must choose 
n =f. The rule shows that the components of the characters 
are integers. 

This result was obtained by Frobenius in a purely algebraic 
Manner, without introducing the continuous group u.%8 But 
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I believe that the real reason for the rule comes to light only 
when we consider this connection between the groups a, and 
U,—in particular, it enables us to understand why a second 
integer 7 in addition to f is involved. 

The dimensionality g of B(fi, fe, +++) is ope 1) substitut- 


ing the argument s=I: 4=/f, 2=7%,=: == 0 in the 
character y. Formula (9.12) is then 

oa, = Lex, 
where the sum 1s extended over all patterns P(f,, fo,°°°). Since 


o, 1s the characteristic of the m-dimensional representation 
c: S-—S of the group u by itself, this merely means that in 
the complete reduction of (c)/ the irreducible representation 
© = H(f1, fo, ° + *) appears exactly g times, as we already know. 
On substituting the explicit expression (16.9) for X we obtain 


of - er reg l| — Ye: le”, gh Foe eM 


g is accordingly equal to the coefficient of eheht- ++ in the 
expansion of the product on the left-hand side. The term 
+ ehiekz . - + cm in the expansion of the determinant must 
be multiplied by the term : 


f! hy— 


—ky phy—k 
Si Sea Me et Ey 1 E5 2 Be eo 
(h, aa Ry) (he aa ko) | i 
of of in order to obtain a contribution to the term eek --- 
of the product. (R,, ke, + + +, &,) here run through the per- 
mutations of 2 — 1,--.-, 1, 0 and g is accordingly equal to the 


alternating sum 


over these permutations, or equal to the determinant 
| oe aoe, 
mee "h—D! A}, 


=a | (k= 1) - + (h—-n+ 2),- ++, A, 1. 


The rows of this determinant consist, on reading from right to 
left, of polynomials in h of degrees 0, 1,- +--+, (x — 1) with highest 
coefiicient 1. The determinant is therefore 


\an-}, see h, 1| 
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and we finally obtain the simple formula 


pcre i i ee 
aa he ea ee 6.19) 


n is to be taken at least as large as the number of rows in the 
pattern P(fi, fo, ° °°); the reader should convince himself by 
direct calculation that the value of (16.13) remains unchanged 
on replacing 2 by » + 1. 

Frobenius’ rule for the character and this formula for the 
dimensionality are vastly superior to (14.7) for purposes of 
practical evaluation. 

As an example, we carry through the computations for the 
case of four electrons; the results are given in the table below. 
The group 7, contains twenty-four elements which are divided 
into five classes of conjugates ; each of these classes is designated 
in the second column of the table by the values (7,7, + + +) as- 
sociated with it. The first column contains the number of 
elements in each of these classes, and the sign + or — indicates 
whether the class consists of even or odd permutations. Fach 
of the five remaining columns contains the valucs of a primitive 
character for the classes in whose row they stand. The symmetry 
pattern to which each of these characters belongs ts indicated at 
the head of the column by the numbers f,, fo, ° + + of elements in 
its rows. ‘The first and the last of these columns may be filled in 
immediately, and the second and third with the aid of Frobenius’ 
rule. The fourth is then obtained from the second on noting 
that its symmetry pattern is the dual of that of 2; we need 
then merely to replace the values in the second column by their 
negative for the (-)-classes. Since patterns 2 and 3 contain 
but two rows we may take n = 2. Hence on writing x, y in 
place of €,, €2 we have merely to find the coefficients of xy (for 
the column 31) and xy? (for the column 22) in the following 
polynomials : 


(% —- y)(x? + y?*)?, 
(% — Me + Na + 98) = (x8 — y)(x8 + y?), 
(x — y)(x* + y*). 
The dimensionalities of the five irreducible representations are 


contained in the first row; they are 1, 3, 2, 3,1. The verification 
of the orthogonality relations is left to the reader. 
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“Pattern. 
No. bs 4 
Elements. 


Class. \_ 


Le 
6— 


3-4 


8 + 
A 


§17. Calculation of Volume on 1 


Consider the line elements going out from the unit point J 
on the group manifold u, 1.e. the infinitesimal unitary trans- 
formations 5S = |[dsa,i|. We may take as the real components 


en | 
of this ‘ vector’’ the n quantities a OS, and the real and 


imaginary parts of the n(n — 1)/2 quantities dsag(a<B); the 
total number of components is thus n?, which is therefore the 
dimensionality of the group manifold u. Now in a linear algebra 
of this kind we may replace any two real quantities a, b by the 
complex quantities a+ 1b, — a+b obtained from them by 
a simple linear substitution; we may therefore replace the 
real and imaginary parts of 85ag(a<8) by 8sag itself and 
or! OSag = OSgx. 

On transporting such an infinitesimal vector to the point 
S on the group manifold by a left-translation its terminus goes 
into the point $+ dS = S(1+ 8S), dS=S-8S; we must 
therefore consider the infinitesimal element 8S = S~dS as the 
“vector’’ which leads from S to S+ dS. Our definition of 
volume on the group manifold [III, §12] consisted in the 
following: the parallelepiped defined by n? vectors 5S leading 
from the fixed point S to the neighbouring points S + dS has 
as volume the absolute value of the determinant formed from the 
components of the n? vectors 8S. In accordance with the above 
remarks we may take as components of the vector 6S = ||dsag|| 
the totality of coefficients dsag themselves. 

Any S can be expressed in the form 


S =: UEU-} (17.1) 


where £ is a principal (diagonal) element of u and U is unitary. 
S is unchanged on multiplying U on the right by any principal 
element. We employ a geometrical terminology which will 
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allow us to visualize our procedure by means of an analogy. 
Two elements U, U’ of u which are right-equivalent with respect 
to the group of principal elements: U' = UE, will be said to 
‘lie on the same vertical [U].’’ From the n?-dimensional mani- 
fold u we obtain by projection the (n? — n)-dimensional mani- 
fold [ut] of verticals [U] on considering all points of u which 
belong to the same vertical to be coincident. This process of 
identifying equivalent elements was described in general in the 
beginning of Chapter III—we had, in fact, already met it in I, 
§ 1, in the special case of projection in affine space. We may now 
consider U in (17.1) merely as a representative element of the 
vertical [U]; on allowing [U] to run through the entire mani- 
fold ({u] and the angles w, of E: 


| e(a) 
e(we) 


e(w,) 


to vary independently over the complete range 0 Sw < 2z 
the element S defined by (17.1) describes the manifold u exactly 
n! times. 

The vector 5U = U~'dU leads from the point U of the vertical 
[U] to the neighbouring point U + dU of the vertical [U + dU]. 
The totality of all points on [U + dU] which are in the neigh- 
bourhood of U is given by expressions of the form 


(U + dU)\(1 + 8E) = U+ (dU + U 8E) 


where 8£ is an arbitrary infinitesimal principal element with 
coefficients 2 6w, on the principal diagonal; the corresponding 
vectors are 5U =8U + 8E. Since the terms in the, principal 
diagonal of 5U are pure imaginary, E may be uniquely deter- 
mined in such a way that all terms in the principal diagonal of 
dU vanish; we call this transition from [U] to (U + dU] the 
‘* horizontal transition from U.’’—The transition from some other 
point UE of the vertical [U] to the point (U + dU)E of [U + dU] 
is accomplished by means of the vector 


8'U = E-)-8U- E. (17.2) 


That this linear transformation (17.2) determined by E, which 
sends 6U into 8’U, is unimodular follows from our general re- 
marks concerning closed continuous groups—and can in this 
case be readily verified by direct computation. Naturally this 
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same equation holds for the horizontal transitions 8U, 8’'U from 
U, UE respectively: | ; 
VU = E-86U- E. (17.3) 


n? — n horizontal vectors 6U leading out from U determine an 
infinitesimal “ parallelogram ’’ whose content is measured by 
the absolute value of the determinant of the 22 — 2 components 
Suag (a += 8) of the various vectors 8U. On allowing each point 
U on the periphery of the parallelogram to describe the vertical 
{U] we obtain a tube whose horizontal sections are parallelo- 
grams; its projection on [u] is the original clement of volume, 
the ‘‘ parallelogram ”’ defined by the 6U. Since the linear 
transformation (17.3), dU + 6’U, is unimodular, the content of 
each horizontal section is the same, and may therefore be con- 
sidered as the content of the volume element on [u}. 

We now examine the variations in [U] and £ in (17.1) when 
S goes over into S+ dS. We have 


SU = UE 
and therefore 
adS:-U+S:dU=dU-E+U-dael. 


On multiplying both sides of this equation by U~!S7! -= [#-1U7! 
we find 

U7-8S-U--su=f'-6U-E+ 6E 
or 


8S = U-8S-U = {E)-8U- E — §U} + 8K. (17.4) 


The components of the matrix contained in parentheses are 


Ss 
Bilan  c == i) 


We now define a parallelepiped at S which shall serve as a 
volume element in the following manner: nn? — 7 of the 2? 
sides 5S are obtained from (17.4) on allowing the angles of 
rotation to remain fixed, i.c. 6/2 =: 0, and drawing n? — 2 hori- 
zontal vectors 6U from the point U to form a volume element 
of magnitude d[U] on [u]; the remaining 7 vectors 6S are then 
chosen in such a way that for each of them one and only one of 
the angles w, changes by dw, and [Uj] remains unchanged. The 
corresponding 7? vectors 6’S define, in accordance with (17.4), 
an element of volume of magnitude 


TU (S = 1) + a(0}- deny deoy + + + deve (ITS 


a 
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Since the linear transformation 6S > 6’S = U~1+8S° U is uni- 
modular this volume is equal to that of the element defined by 
the 5S themselves. Since € = l/e the product II in (17.5) can 
be written 


The final result is: The volume element described by S on allowing 
[U] 1m (17.1) to describe an infinitesimal volume element of mag- 
nitude d[U] on |u| and on allowing the angles of rotation w, to vary 
by dw, has the magnitude 


AK dw, dw, + + + dw, a{U}. (17.6) 


On integrating with respect to d{[U] over [u] we obtain the 
theorem, already ‘applied in the preceding section, concerning 
the magnitude of that portion of u in which the angles of rotation 
have values lying between w, and w, + dw,. 

These considerations remain valid on restricting ourselves 
to the group u of unitary transformations with determinant 1. 
The angles of rotation are then subjected to the restriction 


@ + wet: + * +o, = 0, (17.7) 


and the only difference in the result is that the factor dw, in 
(17.6) is to be omitted. Condition (17.7) allows us to normalize 
the linear form h,w, +: - - + h,w,, in the angles of rotation in 
such a way that h, = 0; the exponents (h,, he, + + -, A,) in the 
weights of the representations of u are then non-negative integers. 
It is desirable, however, not to impose this normalization h, = 0; 
we need then only to remark that only the differences between 
the h; are of significance: the irreducible representations 
D(/, fe, °° *, fx) of u are unchanged on increasing each of the f; 
by the same integer. In particular, these considerations justify 
the expression used in Chapter III for the volume on the group 
manifold of the unimodular unitary group U,, and the results 
of the preceding section constitute a direct proof, which is inde- 
pendent of the completeness theorem, of the fact that the 
representations of u, denoted by ©, constitute a complete set of 
inequivalent irreducible representations of Ug. 
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§ 18. Branching Laws 


Finally, we show the usefulness of our formule for the 
characters by deriving two simple ** branching laws ” from them. 


1, Branching law for the Permutation Group. 


The irreducible representation of m; with the symmetry pattern 
PU S33 ee) reduces, On restricting a, lo the sub-group Tyo of 
permutations of f— 1 things, into the sum of those irreducible 
representations of m;_, associated with the patterns 


Pepe dy faa oo es 
Pee = Wifey? 22) 


. ee @ @e@ @ @ @®  @e@ @ $e @ @ © 


those patterns 1n which the rows are not arranged in decreasing 
length are to be omitted. [Each such constituent appears exactly 
once. (In words, these patterns are obtained from the original 
one by removing a field in turn from the end of each row which 
is actually longer than the following one.) 

Proof. Let s be a permutation of the numbers 1, 2, -° -, 
f— 1 belonging to the class (2; — 1, 72, 23, : “+). Considered as a 
permutation of the fnumbers 1, 2,-+-, f, s leaves the last number 
fixed; the number of one-term cycles 1s thus increased by 1, 
and s, considered as an element of z,, belongs to the class 
(21, 12, 23,°°°). In the expansion 


A-ot loge s+ = Say yw, E1! eft ss - (18.1) 


we have as the coefficients of those terms for which 
es vere 


Anny =O OF Xpazp,... (5) (18.2) 


— 


actually = or not. xy is the primitive character of a,_, belong- 
ing to the symmetry pattern P(f;, fo,-+°). On the other hand, 
the coefficient of ej' 632 -- > (hy > Ap > ++ -] inA-ofof:-- is 
equal to the character yyz,,...(s) of the representation of my 
with pattern P(f;, f2, °° +). Hence on multiplying (18.1) with 
o,=& +6, +°::++ 6, we find 


Xfifers: (s) oe Any—1, he, hs, 0+ = ap, hyg—1, hs, + °° + o> 


Our branching law follows from this result and (18.2). The 
branching law leads to a recurrence formula for the dimension- 


alities gh, te, ia =): 


according as any of the signs S in the above inequalities is 
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2. Branching law for ¢,,. 


~ 


On restricting Cy lo the sub-group of linear transformations of 
an (n — 1)-dimensional sub-space the irreducible representation 
(ft, foo °° *) Of Cy reduces into the sum of all those representations 
(fi, for + + +) Of Cua for which 


h2h2h2h='+: 2h =hnii (18.3) 


each of these constituents appears exactly once. 

Proof. The linear transformations S of the sub-space ¢,_,: 
x, = 0 are simply isomorphic to those linear transformations 
S of the variables x,, %., : °°, X, in which x, 4%,. Hence eg, 


is to be replaced by 1 in the characteristic (16.9). The denom- 
inator is then 


Dey, Cay egg) epee es eed) een 1), 


as can be seen by subtracting the last column of D(e,, €9, - °°, 
€,_,, 1) from each of the previous ones and factoring the resulting 
(n — 1)-row determinant. In order to divide the determinant 
in the numerator by the factor (€, — l)(¢. — 1) °°: (€,_, — 1) 
we subtract the second column from the first, the third from the 
second, : + -, and finally the 2" from the (x — 1). The last 
row then is 0, 0,:-+-, 0, 1; the determinant is thus reduced to 
a determinant of order (x — 1). Now divide each element in 
the vt® row by e, — 1 in accordance with 


h h 
ea Ee": 
7 1 — g’y-] +- e e e +- chs. 


The result is that we then have in the numerator the determinant 
le sn: 96 er ih gh gha- 4+. e 4 chs a | 
(€ = &, &y,° * *, Ena). 
But this is the sum of all (n — 1)-rowed determinants of the form 
je™, gh a coe eM n-a| 
hi> hy Shae > hy 2hg> hy Sh, (18.4) 


On subtracting » — 1 from hy, n — 2 from hy and hy, +: +, 0 


from h,_, and h,, in order to obtain the numbers f [(16.7)], che 


inequalities (18.4) become the inequalities (18.3) and our theorem 
is proved, 
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Proof of an Inequality 
(Page 77.) 


In order to prove the inequality stated on page 77 we must 
show that any continuous and differentiable function %, which 
is defined for all values of the real variable x, satisfies the 
condition 


1 [aucx)” {pyar Va ag oF ae, (* 


provided, of course, that the integrals involved actually exist. 
The Schwarz inequality 


\a,), ae eee Anda? as (a,a, 7 Gale ee A,dy)(d,b, = ann b,.D,) 


employed in Chapter I becomes, on replacing the sums by 1n- 
tegrals—or rather each sum by two integrals— 


| fugidx a | fogedx|? S (\fifdx ig | fafedx)( | giBidx a { go82dx). 
Applying this inequality to 


vWf) = op 4 xg 
by taking 


fh = xp, fp = xf, gy = 4 g= 


and transforming the integral 
{ x (Wp) dx ie | pbde 


by partial integration over the range —oo, +00, we obtain the 

desired relation (*) provided the term seabp, which is integrated 

out, approaches 0 as x -> +00. That this 1s actually the case 

if the two integrals on the right of (*) converge can be seen by 

the following indirect proof. Let ¢ be any pre-assigned positive 
393 
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constant and consider a positive value of x for which x | p(x) |? >e 
fe @) 


2 

and which 1s so large that | = dx S - The Schwarz inequality 
(Hal <i — 2. (YP 
(ea S (x’ — x) \Z ax 


then tells us that forx Sx’ Sx+ - 


— 


S 


oe’) — Wa)? <3 -£, whence [Y(x")| 2 [vn| — A E>4V2 


The integral of x? | 2 over the range from x to x + = is then 


Ss x2 . 1 e ° = — e° 
4x x 4° 
Hence it follows that conversely 


1 c 2 
_€ 
pelvis 


z Eo 


2 
ss 


imply the inequality 
x | p(x)|? Se. 


APPENDIX 2 
A Composition Property of Group Characters 
(Page 169.) 


Tue fundamental property of the irreducible representation 
§) : s > U(s) which is expressed in the equation 


U(st) = U(s)U (i) 
is paralleled by the relation 


x(s)x(d) = ZEx(or te), (*) 


Proof. Jf x, y are two elements of the algebra of the group, 
the second of which belongs to the central, and if 


x>xX, yo Y in §, 


then dade The matrix associated with z= xy in 1s 


1x and its trace 1s on : 
g g 


Ea(r)x(r) = 
On setting 


we find 


] 
Eels) vid) x(st) = als) ¥(0) x(6) x10. 
Since y(t) depends only on the class of conjugate elements to 
which ¢ belongs we may replace 


x(st) by 5 Ex(s4r) 


on the left-hand side of the previous equation. Then the co- 

efficient of x(s)y(t) on either side of the equation depends only 

on the class to which the element ¢ belongs, and since x(s) 1s an 
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arbitrary function, y(t) an arbitrary class function, the assertion 
(*) follows from the fact that the two coefficients must agree. 
We have omitted mention of this equation (*) in the text 
in order not to interrupt the systematic development of the 
theory of representations, which is completely described by the 
orthogonality relations and the completeness theorem. 


APPENDIX 3 


A Theorem Concerning Non-degenerate Anti- 
symmetric Bilinear Forms 


(Page 274.) 


We consider the given non-degenerate anti-symmetric bi-linear 
form 

, 

oe rk Xi Ve (Cee == — Cox) 

ee ee | 

as the “anti-symmetric product” [ry] of the two vectors 
L= (X, %2,° °°, Xr) and Y = (y,). Let e, be any non-vanishing 
vector; then by assumption [e,z] cannot vanish identically in 
r, and consequently a second vector e, can be found such that 
[exe] == 1. The simultaneous equations 


[eit] cae Q, [¢ ox | ==‘ 


then have f — 2 linearly independent solutions e@3, ° ++, ey. These 
vectors are furthermore such that no linear dependence can 
exist between them and ¢y, ¢s, for if 


Yas Ey + Soa + Eacy tose + Sey = O, 


it follows on building the anti-symmetric products [e,r] = &, 
lear} == — €, that &€, = €& == 0. We may therefore choose 
Cy, Co, °° *, Cy aS a co-ordinate system, 1c. as a basis from which 
all vectors may be constructed. Let the anti-symmcetric pro- 
duct be expressed in terms of the components &,, 9, of z, Y in 
this new co-ordinate system by 


f 
[ry] = Ain Es Ne 


The manner in which the new fundamental vectors were deter- 
mined requires that of the coefficients y,;, = [e,€] 


Vaya 0, Vie l ‘ Yi3 > 0, ep EL 0, 
Yo, = —l, Yo. = 0; Yo = O°, Yor = 0. 
397 


398 APPENDIX 3 


In consequence of the anti-symmetry all y,,, y,. with 7 = 3, ---, f 
vanish, and the matrix of the y,, is completely reduced into the 


2-rowed square sub-matrix 
0 1 
—l1 0 


and an (f — 2)-dimensional anti-symmetric matrix. Mathe- 
matical induction with respect to the dimensionality f yields the 
desired theorem that f is necessarily even and that the original 
form can be transformed into 


(E12 = E571) i (E374 == E43) = ee (f/2 terms) 


by an appropriate linear transformation. 
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algebra 167. 
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contragredient matrix 123, representation 123. 

equivalent as correspondences of the ray field 21. 
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FE, energy level 44. 
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f,  4-vector potential multiplied by e/ch 214. 


fxg curl of f, (= dfs dfe <8) 216. 
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F action of the electro-magnetic field 215. 
F(t,, tg, . . -, ty) tensor 139, 281. 
g dimensionality of a group representation 120, Landé g- 
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h Planck’s quantum of action divided by 27 51, order of a 
finite group 118. 

H energy 51. 
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. . terms 1 = 0, 1, 2, 3, 4, 
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presentation 321, 350; (= p») mass of the electron. 
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M, M’ action of the material field 211. 

(M,, M,, M,) = M total moment of momentum 179, 187. 

n dimensionality of a vector space 1; principal quantum 
number 69. 
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P symmetry pattern 358. 

(Gx) Iv, 9z) = q electric dipole moment 83. 

distance from centre. 

element of a group; spin quantum number 206. 

(Sz, Sy, Sz) = 8 electric current density 218, s* charge-current 
4-vector 214. 

(S,, Sy, S,) = © spin 178, 203. 
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1 0 0 | 0 -—1 ] 0 
= ye = oo 148. 
0 1 1 1 0 40 — 1 
t energy-momentum tensor 218. 


T interchange of %,, %, and ¢,’, p,’ 149. 

v valence 369. 

W perturbation energy 86, total action 216. 

Xq XyX_X_ ort x y aco-ordinates of space time (t = x, 98, or ct = Xp 
211). 


GERMAN. (For 3-dimensional vectors see their components 
under Latin letters.) 
¢ =, group of (unimodular) linear transformations in ” dimen- 


sions 128. 

(c)f representation of ¢ whose substratum is the tensors of 
order f 125. 

©, = D,(v = 27) representation of vth degree of ¢, or Up ~ D3 
128, 142. 


D, orthogonal group in dimensions 142; same but in- 
cluding improper rotations 143. 

D(™ 1-dimensional representation of rotation group D, 141. 

Q1, Co, . - -, @, co-ordinate system in vector space 2. 

( unitary representation of the rotation group induced in 
the function space of p(x y 2) 143. 

q abstract group 114. 

{, conjugation 118. 

M mean value 158. 

yt representation of the rotation group induced in system 
space 187. 

p, J invariant sub-space of rt, Rf respectively 287, 282. 


n 
v an algebra considered as a vector space 286, fy = 1 = i RY 
290, 350. 
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R vector space |, R/ corresponding space of tensors of order f, 
[R/] space of the symmetric tensors, {Rf} space of the 
anti-symmetric tensors, 239, 242. 

R,, Ra system space of clectron translation, spin 196. 

t,  left-translation 116. 

uw = u, (unimodular) unitary group in ” dimensions 139. 

% ray representation giving rise to algebra of complex 
quaternions 182. 

t vector in ” dimensional vector space 1. 


GREEK 


a == e*/ch fine structure constant 216. 
d,, Kronecker symbol = 1 or 0 according asi = k ori + k 17. 
+e 


0(x) Dirac 6-function (= 0 except for x = 0 and J 8(x)dx == 1} 
255. —e 
5, = +1 according as s is an even or an odd permutation 121. 
) signature 201. 
he 2 aie 52 
ae ae oy Tv 3 aplace s operator a 92. , 


ry) 
v= PH Si — 212. 
€ generating element of a right- and left-invariant sub-space 
311. 


6,4 polar co-ordinates 60. 
p.(= m) mass of the electron. 
v frequency 50. 

el 
ae PATE 
7 = m7, Symmetric group of permutations of f objects 121. 
p electric charge density 218, an algebra 304. 
gd, electro-magnetic 4-vector potential 98. 

vector defining the state of the material field 49. 
x, X group characteristics, 150, 151. 
w angle of rotation 151. 


, Larmor factor—unit of Zeeman separation. 


INDEX 
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Abelian group 118, its unitary irreduc- 
ible representations 140, in ray space 


182, quantum kinematics as A. g. 
of rotations 272 ff. A. system of 
forms 26. 
Absorption of photon 44, quantum 
theory of a. 107, 224, 261, a. lines 45. 
Action of material field 211, of electro- 


magnetic field 215, total 216, 222. 
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ences 6, of matrices 7, of repre- 
sentations 126, of elements of an 
algebra 105, 303, of numbers of a 
field 302, direct sum of algebras 311. 
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112. 
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66, 181, 286, simple 311, 31} 
semi-simple 316, order of a. 304, 
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basal units 168, 304, division a. 
(== field) 304, 316, central of a. 167, 
311, invariant sub-a. 167, 280, 
generating unit of s.-a. 168, 291, 
direct sum $11, direct product 333, 
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304 ff, regular representation 289, 
complete reduction of representation 
306 ; — a. of complex quaternions 182, 
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its enveloping a. 284, reduction of 
a. of linear transformations 307 ff. 


doublets 
Zeeman 


Alkali spectrum 85, 86, 202, 
in 204, with anomalous 
effect 205. 


Alkaline earth spectrum 207, 246. 
Alternation 358. 


Alternation law 207, 370. 


Atom, Rutherford’s model xiii, Bohr’s 
theory of a. 43, radiation on classical 
and Bohr theories 44, on quantum 
theory 104 ff., 256 ff., Hund’s vector 
model of a. 191, 244; see Spectrum. 


Automorphism 115, automorphic corre- 
spondence of group 134, 


Auxiliary quantum number, see under 
Quantum number. 


Azimuthal quantum number, see under 
Quantum number. 


Balmer 45. 


Bessel’s inequality 33, 
representations 169. 


for system of 


Black body radiation 41, 104, 256. 
Bohr, H. 39. 

Bohr magneton 66, 205. 

Bohr, N. xiii, 43, 95, 105, 236, 245. 
Boltzmann 108, 

Born 48, 74. 

Bose 50. 

Bounded Hermitian form 39. 
Brackett 46. 


Branching rule, for spectra 207, for 
linear and permutation groups 390 ff. 


de Broglie, L. 48, 53, 211, 220. 


Burnside’s theorem 153. 


Canonical variable §2, 
formation 96. in quantum mechanics 
98, c. aggregate 79, c. basis for 
rotations in ray space 274. 


of algebra 167, 


94, c. trans- 


Central, of group 118, 
313. 
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presentation 156, primitive c. 150, 
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perties 156, 159 ff., 317. For char- 
acters of special groups see under 
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Character of element of algebra 295. 


Characteristic number of Hermitian form 
or operator 21, 35, of unitary form 26, 
multiplicity of c. n. 22, 26, of energy 
56, 80 ; — characteristic vector or func- 
tion 21, 35, of wave equation §6, 80; 
—c. space 22, of energy 80, 192, of 
moment of momentum 189, 192. 


Class of conjugate elements 118, in 
symmetric permutation group 328; 
— c. function 150, 156, as element in 
central of group algebra 169. 


Classical mechanics compared with 
quantum mechanics xiii, 73, 81, 94, 


190, ‘‘c.’? combination principle 47, 
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Clebsch-Gordan series 128, 163, 190, 


371, aS quantum rule for composi- 
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Closed shell 86, 245. 
Cogredient transformation §. 
Collision phenomena 46, 70 ff. 


principle, Ritz-Rydberg 
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Combination 
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274, interpretation of 275, wave 
equation derived from c. r. 277 ff., 
c. r. for infinitesimal rotations 178, 
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Pauli exclusion principle 244, method 
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Compton effect 224. 
Condon 74. 
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Conservation law, for electricity 214 ff, 
energy 82, 218, 220, momentum 218, 
220, moment of momentum 188, 
221, Dhirac’s c. l. 227, of quantum 


held 264 ff. 


Contact transformations 96. 


Contragredient transformation 12, re- 


presentation 123. 
Contravariant vector 13. 
Convex region 79. 


Co-ordinate system, in vector space 2, 
adapted to sub-space 3, transforma- 
tion of c. s. 4, normal c. s. 16, 
21, Heisenberg’s c. s. 80, in special 
relativity 147, in general relativity 
219. 

Correspondence or _ transformation, 
general 110, identical 110, inverse 


iI, product III, isomorphic 112, 
automorphic 134, similarity 283; — 


linear 65 ff., 21, = projection 282, 
in function space 35, trace 11, 1 5° 
dual 18, 123, contragredient 12, 


Hermitian 18, unitary 16, 
imal unitary 28 ff., rotation of ray 
space 20, X-multiplication 90,  re- 
duction and complete reduction 9, 
irreducible system of I. c. 122, 153 ff., 
symmetric c. in tensor space 282, 
For special groups of correspondences 
see under qualifyin;: adjective. 


infinites- 


INDEX 


Correspondence principle 95. 


Coupling, Russell-Saunders or (s/) 206, 
(77) 206. 


Courant 4o. 


Covariant linear quantity 173, in 
quantum mechanics 197; — c. vector 
13. 

Cycle of a permutation 328. 


Cyclic group 117. 


Davisson 50, 53, 70. 


Decomposition, see Complete reduction, 
of space 3, 122, of dual space 14, 
in unitary geometry 18, into char- 
acteristic spaces, 22. 


Degenerate 
of 86, 


Degree of a representation 120. 


system 83, perturbation 
accidental degeneracy 192. 


§-function 36, 255. 
Derivative of operator 94. 


Dimen>ionality of space 2, 3, 
representation 120, 


of a 


Dirac 109, 210, 211, 217, 225, 255, 260, 
262, 357. 


Dirac’s relativistically invariant equa- 
tions for electron 213, 218, 225, in 
central field 227 ff., quantization of 
253 ff.; — D. theory of proton 262 


Directional quantization 67, 75, 205. 
Dispersion 53, 224. 
Division algebra (== field) 304, 316. 
Double tensor 347. 


Dual space 12, matrix 13, system of 
transformations 123, symmetry ele- 
ment and representation 352, sym- 
metry pattern, 361, 369. 


Dynamical variable, represented by 
Hermitian form 74, 275, measure- 
ment of 74 ff., mean value or ex- 
ara 75, intensity on transition 
3, 197, composition 91, totality of 
d.v. represented by irreducible system 
238; — d. law 54, 80 ff., 97, 187, 266. 


Dynamically independent systems 92. 


Effective quantum number, see under 
Quantum number. 


Einstein 42, 50. 


Electric charge, atomicity of 216, post- 
tive and negative 262, e. c. density 
and current density 215, conserva- 
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tion of e. c. 214, 217, e. dipole moment 
83, 104, 197. 


Electro-magnetic field, effect on charged 
particle 98, 213, 222, interaction with 
matter 105, 261, equations of 102, 


218, quantization 104, 253, action 
215. 


Electron, de Broglie’s equation for e. 
53, Schrodinger’s §4, 111, Dirac’s 
213, e. beams 50, spin 195, 196, 
203, 276, translation 196, in spher- 
ically symmetric field 63, 227, nega- 
tive energy levels and ‘‘ positive e.” 
225, existence vs. constitution of e. 
261, e. and proton 262. 


Element, of group 114, of group alge- 


bra 166, of algebra 303, idem- 
potent e. 168, 291, independent 292, 
primitive 293, real 295, trace 299, 
317, scalar product 299, character 
of ane. 295. 

Elsasser 74. 

Emission, of photon 44, quantum 
theory of e. and absorption 107, 224, 
261, spontaneous 107, stimulated 
108. 


Energy, and its operator 51 ff., 80 ff, 
97, 187, 215, e. level 44, 50, in 
collision phenomena 70, in perturba- 
tion theory 86 ff., on composition 92, 
in electro-magnetic field ror, with 
spin 215, 220, e. of radiation field 
103, 258, e. of simple state 189, 101, 
of system of equivalent individuals 
320 ff., 3560, of molecule 346, ex- 
change e. 322, 342, 346, e. and 
momentum 51, 218, 220, conserva- 
tion 188, zero-point e. 104, 258, 261, 
inertia of e, 221, e. quantum 41. 


Enveloping algebra 284, for double ten- 
sors 348. 


Equality, axioms of 112. 
Equivalence degeneracy 239 ff., 320. 


Equivalent individuals, state of system 
consisting of e. 1. 239 ff., energy 241, 
320 ff., 356, quantization 246. 


Equivalent systems of linear transforma- 


tions 121, e. representations 120, 
sub-spaces 135, 283, e. points with 
respect to transformation 112, e. 


elements with respect to sub-group 
118, 


Euclidean geometry 15, 112. 
Exchange energy 322, 342, 346. 


Expectation or mean value of physical 
quantity 75, 78, 92. 


Exponential function 28, of matmx, 29. 
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Factor group 119, 132. 
Faithful realization 114. 
Ferro-magnetism 347. 


Field equations, for electro-magnetic 
field 102, 218, for matter 213 ff., 
their quantization 104 ff., 253 ff. 


Field, number f. 294, 302, algebraically 
closed 294, commutative 302, finite 
f. of modulus ~ 303; — ray f. 20, 
vector f. 20, point-f. L110. 


Fine structure, in hydrogen 203, 
f, s. constant 216. 


236 


bi-linear 13, 16, 18, 
unitary 16, commu- 


Form, linear 12, 
Hermitian 18, 


tator 273, anti-symmetric _ bi-linear 
273, 397. 
Fourier coefficient 33, series 33, in- 


tegral 39, F. c. or group matrix for 
representation 165. 


Franck 46, 70, 74. 


Frequency 50, Bohr’s f. rule 47, 105, 


109. 
Frobenius 156, 358, 383. 


Function space 32, of quadratically 
integrable functions 143. 


Galois, 132. 
I’-process 126. 
Gamow 74. 


Gauge invariance 100, 213, 220, rela- 
tion to conservation of electricity 214, 
217, role in quantization 256, 271. 


Generating function of infinitesimal 
canonical transformation 97. 


independent 292, 


Generating unit 291, 
of 


in field of complex numbers 295, 
symmetry class of tensors 296. 


Geometry, affine or vector 1 ff., 112, 
Euclidean 15, 112, unitary 15 ff., 
characterized by group I12. 


Gerlach 65, 75. 

Germer 50, 53, 70. 

g-factor, Landé, 204, 205, 207. 
Goudsmit 203. 


Group lioff, transformations g. 111, 
abstract 114 ff., isomorphic 115, 
automorphic correspondence of g. 
115, 134, commutative or Abelian 
118, cvclic 117, order of finite g. 
118, of element of g. 117, central 


INDEX 


118, sub-g. 116, index of sub-g. 
118, self-conjugate or invariant sub-g. 
119, 132. factor g. 119, simple 135, 
direct product 127, closed continuous 
160 ff., Lie theory of continuous g. 
175 ff., g. manifold 160ff., invariant 
sub-space of g. manifold 291 ; — realiz- 
ation of g. 114, representation of g 
120, of sub-g. 127, 334, of direct 
product 333, g. matrix 165, algebra 
of g. 166, 181, 286. For special 
groups, see under qualifying adjectives. 


Gurney 74. 


Gyro-magnetic effect 205. 


Hallwachs 42. 
Hamilton 50, 138. 


in classical 
‘ in quantum mech- 
in quantum field theorv 253. 


Hamiltonian equations, 
mechanics 96, a8, 
anics 94, 


Heisenberg xiii, 48, 80, 82, 222, 264, 
347. 


Ifeisenberg’s co-ordinate system 80. 


Heisenberg- Pauli theory of the quantum 


field 253 ff. 
Heitler 342. 
Hellinger 39, 40. 
Hermite 18. 


Hermitian form or operator 18, non- 
degenerate 18, positive definite 18, 
unit 15, idempotent 23, in function 


space 35, 37, bounded 39, product 
of H. f. 20,- trace 20, characteristic 
number 21, 35, transformation to 


principal axes for single H. f. 21 ff., 
32, for Abelian system 25; — H. f. 
represents physical quantity 74, 275, 
characteristizes statistical aggregate 
79, 239; -— H. conjugate 17. 


Hermitian polynomials 57 ff. 
Hertz, G. 46, 70, 74. 
Hertz, H. 42. 

Hilbert 39. 

Hilbert space 32. 


Hund’s vector model of the atom 191, 
244. 

Hydrogen atom 45, on Schrodinger’s 
theory 63 ff., on Dirac’s theory 234 ff, 
spectrum 45, 69, fine structure 203, 
230. 


INDEX 


Idempotent Hermitian form 23, 
independent 233 — 
algebra 168, 291, 
primitive 293. 


37> 
i. elernent of an 


independent 292, 


Identity correspondence 6, 110, repre- 
sentation 121. 


Independent, linearly i. vectors 2, 1. 
idempotent forms 23, idempotent 
elements of algebra 292. 

Index of sub-group 118. 


Infinitesimal unitary transformation 28 ff., 
rotation 27 ff., moment of momentum 
induced by i. r. 178, canonical trans- 
formation 96, element of continuous 
group 160, 177. 


Inner quantum number, under 


Quantum number. 


S€€ 


Intensity, as measure of probability 49, 
i. of dynamical variable on transition 
83, 197, of spectral lines 44, 83, 232, 
in anomalous Zeeman effect 201. 


Interaction between matter and radia- 
tion 104 ff., 261. 


Interchange, of right and left 225, of 
past and future 109, 227, 263. 

Invariance, in special relativity, dif- 
ficulty for quantum mechanics 54, 
Dirac’s treatment 210 ff., 1. of 


quantum field equations 268 ff.; — in 
sense of general relativity 219, under 
change of gauge 100, see Gauge 


invariance. 
Invariant of transformation group I17, 
170, in representation space 171. 


classical theory 170 ff. 


Invariant sub-space 8, under system 


of transformations 122, 135, 282, 
left-1. s.-s. in group space 289 ff., _ left- 
and right-i. s.-s. 168, 311, in tensor 


space 296 ff., significance in quantum 
theory 320; —-i. sub-group. 119, 
maximal 132. 


Inverse correspondence 6, 111, elernent 


of group I14. 
Involution 13. 
Ionization potential 46. 


Irreducible invariant sub-space 122, 282, 
systern of linear transformations, re- 
presentation 122, reduction into i. 
constituents 122, 135 ; — irreducibility 
= complete irreducibility in unitary 
domain 136, 292, 301, for reducible 
algebra 305. for algebra of trans- 
formations in completely reducible 
vector space, 307. 
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Isomorphic correspondences 112 


simply isomorphic groups 115. 


Jeans 42, 102, 103 

(77) coupling 206. 
Jordan-Ho6lder theorem 131 ff. 
Jordan, P. 261, 280. 


Kinematically independent systems 92, 
190, perturbation of 93. 


Kinematics of system determines repre- 
sentation in system space 189, 
Heisenberg’s quantum k. 94 ff., as 
Abelian group of rotations 272 ff, 
in second quantization 250, k. of 
spin 195, 203, 276. 

Klein’s Erlanger programme xv, I12. 


Laguerre polynomials 70. 
Landé, 204, 208. 
Laporte’s rule 201, 203. 


Legendre polynomials and _ associated 
functions 62, with spin 230. 


Lenard 42. 
Leonardo da Vinci 112. 
Lie 176. 


Light, wave and corpuscular nature of 
48 ff., 53. 


Linear, |. algebra 303, see Algebra ; —- 
]. correspondence 5, see under Corre- 
spondence; — |. form 12, 1. covariant 
quantity 173, 1. projection = 1. cor- 
respondence 282, |. sub-space 2; — 
]. momentum, see Momentum, linear. 


Linear group, complete ¢, 123, simplest 
representations 123 ff., representa- 
tion Gy of ¢, 128 ff., its ir- 
reducibility 299, representation (Gy, 
131, 164 ; — reduction of (c)f equivalent 
to reduction of algebra of symmetric 
transformations 284 ff., unitary re- 


striction immaterial 285, result of 
the reduction 301, characteristics 
335 ff., relation to characters of 


symmetric permutation group 326- 
representations of order f 309, 
branching law 391. 


London 342, 346, 370. 
Lorentz group, restricted, obtained from 


t, 147 ff., complete L. g. obtained on 
adding reflection 147, positive and 
negative transformations 147, and 


Dirac’s equations 212 ff., transforma- 
tion induced in system space 268 ff. 


Lyman 45. 
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Magnetic quantum number, see under 
Quantum number. 

Magneto-mechanical anomaly 205. 

Magneton, Bohr 66, 205. 

Magnitude, absolute, of vector 16, 19. 


Mapping _ IIo, 
Transformation. 


see Correspondence, 


Matric algebra, ‘simple 168, 313. 


Matrix 7, dual or transposed 13, unit 
6, addition 7, multiplication 8, re- 
duced and completely reduced 49, 
transformation of m. 8, norm I1, 
trace 11;—gronp m. 165. 


Maxwell’s equations 102, 218, quan- 
tization of 104 ff., 253, M. action 215. 


Mean value or expectation of physical 
quantity in pure state 75, 78, 92, in 
mixed case 79; — m. v. over group 
manifold 158. 


Measurement of dynamical variable 74 ff 
Metric 15. 

Millikan 42, 245. 

Minkowski, H. 79. 

Mixed state 79. 


Modulus, of algebra 168, 304, reduc- 
tion of 168, 301 ; — of finite field 303, 


Molecule, spectrum 191, perturbation 
theory and constitution 339 ff., non- 
polar bond 342, London formula 
for binding energy 346, on taking 
account of Coulomb forces 356, val- 
ence theory 369 ff. 


Moment of momentum of a representa- 
tion 179, of D; 179; — m. of m. of phy- 
sical system 187, orbital 64, 195, 
spin 195, 203, 218, behaviour on 
composition 190, conservation 188, 
219 ff., 227, reduction of system 
space with respect to m. of m. 192, 
induced by infinitesimal rotations of 
Lorentz transformations 185, 269. 


Momentum, linear, and its operator 51, 
220, conservation of energy and m. 
218, 264 ff. 


Moseley’s law 69. 
Motions, geometrical 111, 


Multiplet 196, 206, 373, as _ relativis- 
tic phenomenon 204, 234, normal 
Zeeman effect 101, 193, 198, anom- 
alous Zeeman effect 204, 208 ff., 
alkali doublets 204, singlets and 
triplets in alkaline earths 207, 246, 


group of 176. 


INDEX 


multiplicity 321, 350, under Pauli 
exclusion principle 352, in 2-dimen- 
sional spin 35 *» 369, multiplicity and 
valence 369 ff., branching rule and 
alternation law 207, 370. 


Multiplication, of vector by number 1, 
of correspondences and matrices 6 ff., 
of numbers of field 302, of elements 
of algebra 165, 303, quaternion m. 
138, outer or X-m. of spaces, vectors, 
operators 90, 125, of representations 
126, direct product of groups 127, 
333, of algebras 333, X-m.of repre- 
sentations 127 ; — scalar m. of vectors 
16, of elements of an algebra 299, 317. 


v. Neumann 4o, 78. 


Noether, E. 134. 


in rel- 
state of atom 45, 


Normal co-ordinate system 16, 
ativity 147, n. 
n. term order 206. 


Number, of field 302, operations on 302; 
— characteristics n. 21. 


Operator = linear correspondence 
Hermitian 18, in function space 
representing dynamical variable 
considered as»function of time 
derivative of 0. 94. 


6, 
35; 


55; 
81, 


Orbit, in older quantum theory 
orbital moment of momentum 


195. 


Order, of finite group 118, of element 
of group 117, of sub-group 118, 
of finite algebra 303. 


Orthogonal group, see Rotation group ; 
— o. transformation 16, 0. vectors 16. 


47, 
64, 


Orthogonality relations 32, for group 
characters 159 ff,, 317, for syms 
metric permutation group 367. 


Oscillator 43, 56 ff., 84, black body 
radiation as system of o. 102 ff., 258, 
quantum mechanical laws of system 
of o. 249. 


Parseval’s equation 33, 35, 162. 
Paschen 45, 236. 
Paschen- Back effect 208. 


Pattern, see 


pattern. 


Pauli 77, 203, 211, 244, 204, 347, 351. 


Pauli exclusion principle 207, 244 ff., 
and reduction of algebra of sym- 
metric transformations 281, 323, 347 ff., 


355, 370 ff. 


symmetry, Symmetry 


INDEX 


Peirce reduction 312. 


Periodic system of the elements 60, 
242 ff. 


Permutation 11, reduction into cycles 
328, conjugate 328, as operator on 
tensor 281. 


Permutation group, symmetric 121, 
classes 328, elements as symmetry 
operators 286, relation to symmetry 
class of tensors 286 ff., for arbitrary 
p. g. 332, characters 320, 383 ff, 
relation to characteristics of unitary 
group 331, use of characters to 
calculate exchange energies 322 ff,, 
energy of non-polar bond 346, ex- 
plicit theory of representations 358 ff., 
reciprocity theorems 339, branching 
law 390. 


Perturbation theory 86 ff, 
matically independent systems 93, 
for equivalent individuals 321 ff., 
for molecules 339 ff.; — p. energy 86, 
for axially symmetric field 192, for 
magnetic field 101, 193, 204, 224, 


for kine- 


for electric field 101, 224, spin p. 196, 
in Dirac theory 224, determines 
transition probability 89. 

Pfund 46. 


Photo-electric effect 42. 

Photon 42, 49, 54, 104, 248, 258, 261. 
Planck xiii, 41. 

Planck’s radiation law 41, 108. 
Point-field rro. 


Polynomial, characteristic 11, 22 ; — Her- 


mitian 57 ff., Legendre 62, with 
spin 230, Laguerre 70. 
Primitive unit 293, character 150, 


symmetry class 358. 


Principal unit of algebra 168, 304; 
—- p. transformation 128, transforma- 
tion of Hermitian forms to p. axes 21, 
25, 32, 39, for unitary forms 26, 39; 
—- p. quantum number, see under 
Quantum number. 


Probability, relation to intensity 49, 
that a dynamical variable assume a 
given value in a pure state 75, ina 
mixed state 79, p. density and current 
density 50, 215, 217 ; — transition p. 73, 
83, 89, in composite system 90, 93, 
for an atom in radiation field 106 ff. 


Product, see Multiplication. 


Projection, with respect to sub-space 4, 
in unitary geometry 18, orthogonal 
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and unitary-orthogonal 23, linear 


p. = linear correspondence 282. 
Proton, Dirac’s theory of 262. 


Pure state 75, conditions for 77. 


Quantization, 


in the older quantum 
theory 47, 


in Schrodinger’s theory 
51, 56, in Heisenberg’s 93 ff., of 
composite system 89, of  electro- 
magnetic field 104, 253, second 246, 
of Maxwell-Dirac field equations 
253 ff. ; — directional or space q. 67, 75, 
205. 


Quantum, of action 41, 51, of energy 41. 


(Quantum kinematics, Heisenberg’s 94 f., 
as Abelian group of rotations 272 ff., 
in second quantization 250. 


Quantum mechanics, 
74 ff., dynamical law 54, 80, 97, 187, 
266, composition 91, Heisenberg’s 
formulation 93, Schrddinger’s equa- 
tion 54, 101, Diurac’s equations 213, 
218, Heisenberg- Pauli q. m. of wave 


fields 253 ff. 


Quantum number, auxiliary (4) 228, 
selection rules 233, relation to azi- 
muthal and inner q. n. 228, 233; — 
azimuthal gq. n. (4, }) 64 ff., 142, 196, 
determines orbital moment of mo- 
mentum 65, 196, selection rules 84, 
201, On composition 194, 207, 373, 
relation to auxiliary q. n. 228, 233; 
— inner q. n. (7, /) 189, 196, deter- 
mines total moment of momentum 
179, 189, behaviour on composi- 
tion 190, 194, 206, selection rules 
198, relation to auxiliary q. n. 228, 233; 
— magnetic (m) 64, 193, determines 
z-component of moment of momentum 
65, 180, 189, selection rules 85, 198, 
of spin and of orbital moment of 
momentum 209, in Dirac’s_ theory 
232; — principal or total (#) in hydro- 
gen 69, in hydrogen-like spectra 85, 
has no group-theoretic significance 
144, true 86, 243, effective 243; — 
radial 64, 144; — spin (s) 206,  re- 
lation to valence 369. 


general scheme 


Quantum state 43, 56, 80, 188, simple 
189. 


Quaternion 138, complex 182. 


Radial quantum number, see under 
Quantum number. 


Radiation, from atom 44, 83 ff., 105 ff., 
224, field 102 ff., 215, 256 ff., black 
body 41, 104. 
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Ray 4, 20, 
system 75, 
r. field 273, 


Rayleigh 42. 


represents state of physical 
r. field 20, rotations of 
r. representation 181 ff. 


Real element of algebra 167, 
ating unit 295. 


gener- 


Realization of group 114, faithful 114, 
contracted 118, 119, of algebra 166; 
— linearr. = representation 120, see 
Representation, 


Reciprocity theorem, for arbitrary group 
338, for permutation group 339. 


Reduction of correspondences or re- 
presentation 9, 122, uniqueness 136, 
156, complete r. 9, 129, 135 (see 
Complete reduction), sometimes im- 
plies complete r. 18, 123, 136, 292, 301, 
306, 308, of regular representation 
289 ff., 305 ff., of system space of 
equivalent individuals 238 ff.,  anti- 
symmetric r. for electrons 242, 351 ff., 
symmetric r. for photons 248, 351 ff., 
influence on term spectrum 241, 372 ff., 
general treatment without spin 296 ff., 
with spin 347 ff., for symmetric and 
anti-symmetric cases 351 ff 


Reflection, signature induced by r. 143, 
146, 188. 


Regular representation 289, reduction 


305 ff. 

Relativity theory, special 51, 98 ff., 146 ff., 
of quantum mechanics 210 ff., of 
wave fields 268 ff., r. and spin 204, 217, 
222 ff.,; — general 219. 


Representation, of finite group 120, 
of continuous group 160 ff., by ro- 
tations of ray space 181, degree or 
dimensionality 120, character 150, 
complete reduction 122, irreducible 
122, uniqueness of reduction 136, 
156, criterion for irreducibility 159, 
identical 121, equivalent 121, unit- 
ary 136 ff., any r. equivalent to unitary 
r. 157;—— formal processes: addition 
126, x-multiplication 126, 127, x- 
multiplication 127, JI’-process 126, 
r of sub-group 127 ; — of algebra 166, 
304 ff., regular 289 ; — general theory: 
orthogonality properties 157 ff., 317, 
in terms of group algebra 165 ff., 
completeness of system of r. 159, 170, 
318, proved by reduction of regular 
r. 305 ff. For r. of special groups, 
see under qualifying adjective. 

Resonance, between states of same energy 


87, between equivalent individuals 
_ 239 ff., 320. 
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Resonance line 45. 


Ritz-Rvdberg combination principle 44, 
48, 82. 


Rontgen 43. 


Rotation group, in 2-space and its re- 
presentations 140 ff., orthogonality 
of characters 162; in 3-space 
and its representations 142 ff., rela- 
tion to unitary group in 2-space 144, 
augmentation by improper rotations 
143, orthogonality of characteristics 
163, completeness 143, 163, 180, 184, 
389, generated by infinitesimal ele- 
ments 175, representation induced in 
system space 185, 195,372; — in 
m-space 184. 


ee 


Rotation in ray space 21, I81, 273, 
representation by r. of ray field 180, 
quantum kinematics as Abelian group 


of r. 272 ff. 
Rupp 50. 
Russell-Saunders coupling 206. 
Rutherford xii, 74. 
Rydberg number xiii, 45, 69. 
Scalar product, see Multiplication. 


Scalar quantity, commutes with moment 
of momentum and_ signature 188, 
selection rules 197. 


Schrodinger 48, 50, 56, 102, 187, 216, 
220, 258. 


Schrodinger’s equation 54 ff.,_ relativ- 


istic IOI, for system of equivalent 
particles 194, as limiting case of 
Dirac’s 234, derived from  com- 
mutation rules 277 ff. 

Schur, I. 152. 

Schwarz’ inequality 30, 393. 

Second quantization 246, see under 


Quantization. 


Secular equation II, 21, 26, 


in quantum 
theory 88, 209, 344. 


Selection rules 44, 84, 85, for oscillator 
84, for electron without spin 84 ff., 
with spin 232, for scalar quantity 197, 
for vector quantity 197, for auxiliary 
quantum number 233, azimuthal 84, 
201, inner 198, magnetic 85, 198, 
for signature 201. 


Self-conjugate sub-group 119, maxima 
132. 


Semi-simple algebra 316. 


INDEX 


Separation of terms by perturbation 87, 
321, axially symmetric perturbation 
193, in normal Zeeman effect 101, 193, 


198, in anomalous Zeeman effect 204, 
208 ff 

Series, in hydrogen 45, 69, in alkalies 
85, 202. 


Series of composition, see Composition 
series. 


Signature, of representation 143, as 
dynamical variable 188, 203,  selec- 
tion rule 201. 


Simple algebra 311, 313, group 132, 


state 189. 
(s/) coupling 206. 
Smekal-Raman effect 224. 
Sommerfeld 193, 236. 


Space, affine, linear, vector 1 ff., linear 
sub-s. 2, dual 12, unitary 15 ff., 
Hilbert or function 32, 143, reduction 
or deromposition 20, 22, composition 
series 122, 135, product 90, tensor 
125, 281 ff., group s. 115, 160, re- 
presentation 120, 17! ff., algebra as 
vector s. 286, 305, system, see System 
space 


Space quantization 67, 75, 205. 
Span, space spanned by vectors 3, 20. 


Spectrum, atomic, line s. reduced to 
terms. 44, of hydrogen and t-electron 
ions 45, in Schrodinger’s theory 69, 
in Dirac’s theory 234, of alkalies 85 ff., 
doublets 204, of alkaline earths 207, 
246, 3-electron 374, of elements of 
periodic table 206 ff., 242 ; —- general 
theory, without spin 194, with spin 


206 ff., application of Pauli ex- 
clusion principle 242 ff., group- 
theoretic classification 369 ff.,  re- 


duction into term classes 283 ff., 320 ff, 
calculation of term values 320 ff.; 
— molecular Lor ; — of characteristic 
numbers 36, 


Spherical harmonics 60 ff., 84, as basis 
of unitary representation in function 
space 142, with spin 230 ff. 


Spin, electron 195, 196, 203, as relativ- 
istic phenomenon 204, 217, 222 ff., 
Ss. moment of momentum 195, 221, 
magnetic effect 204, 224, s. and 
valence 369 ff. ; — s, perturbation 196, 
203, in Dirac’s theory 222 ff.; -— s. 
quantum number, see under Quantum 
number. 


Stark effect, linear 102. 
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State of a physical system, represented 
by vector or ray in system space 54, 
74 ff., pure 75, 78, mixed 79, of 
total system under-determined 92 ; — 
quantum or stationary 43, 56, 80, 188, 
simple 189. 


Stationary state, see under State. 


Statistical aggregate 78, 239, canonical 


79. 
Statistics, Bose-Einstein 50. 
Stern-Gerlach effect 65, 75, 205. 
Stieltjes integral 37 
Stoner’s rule 243. 


Sub-algebra, left-invariant 289, (left- 
and right-) invariant 167, 311, 314. 


Sub-group 116, 334 ff., cyclic 117, 
index 118,  self-conjugate or invariant 
119, maximal invariant 132. 


Sub-space 2, 32, invariant, under single 
transformation 8, under system of 
transformations 122, equivalent or 
similar 135, 283, see also Invariant 
sub-space. 


Substitution 111, see Correspondence. 


Sum, see Addition ; —s. rule for influence 
of magnetic field, 209. 


Superposition principle 49. 


Symmetric permutation group, see Per- 
mutation group, symmetric. 


Symmetric transformation in tensor space 
282, special 284, Hermitian 283, 
unitary 285, enveloping algebra 284, 
for arbitrary permutation group 332. 


Symmetrization 358. 


Symmetry class of tensors 287, 206, 
primitive 358, of spectral terms 321, 
multiplicity 321, 350 ff., 367. 


Symmetry operator 286, Young’s 359. 


Symmetry pattern 358 ff., dual on trans- 
posed 361, 368, generated by Young 
symmetry operator 359 ff. 


System space for translation 54, 74, 195, 
for spin 195, total 185, 196, 347 ff, 
for equivalent individuals 186, 206 ff., 
347 ;—reduction with respect to 
energy 80, moment of momentum 
188, 206, with regard to symmetric 
permutation group 283 ff., 320 ff., 
with regard to Pauli exclusion prin- 
ciple 242 ff., 281 ff., 347 ff. 
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Tensor 125 ff., 139, 281, symmetry (| Unitary group, in 2-space 137 ff., its 
class of t. 287, 338, 358, double to} unitary representations @y 137, com- 
347 ;——t. space 125, 281 ff., symmetric} pleteness 137, 163, 389, character- 


transforination in t. space 282, in- 
variant sub-space 296, reduction 301 ; 
— energy-momentum t. 218. 


Term 44, as energy level or character- 
istic number 46, 56, 80, see also under 
Spectrum, Separation ; — t. order, 
normal 206. 


Thomson, G. P. 50. 


.Total quantum number, see under 
Quantum number. 


Trace, of matrix or correspondence 11, 
150, of element of algebra 299, 317. 


Transformation, linear 4 = Correspond- 
ence, linear ;— contragredient 12, unit- 
ary 16, principal 128, symmetric in 
tensor space 282, for arbitrary per- 
mutation group 332, special sym- 
metric 284, canonical 96, in 
quantum mechanics 98 ; —t. to principal 
axes 21 ff., 37;—t. group 111, for 
special groups, see under qualifying 
adjective. 


Transition probability 83, 89, in radia- 
tion field 106 ff. 
Translation, left- 116, cight- 116. 


Translation, electron 195. 


True quantum number, see under 
Quantum number. 

Uhlenbeck 203. 

Uncertainty principle 77, derivation 


393. 


Unimodular linear transformation, group 
128. 


Unit, element of group 114, of field 302, 
of algebra (modulus or principal unit) 
304, basal 168, 304, idempotent 

enerating 168, 21. independent 
599. primitive 293, real 295; — u. 
Hermitian form 15. 


Unitary correspondence, transformation, 
matrix 16 ff., characteristic numbers 
26, infinitesimal 28, u. geometry 
15 ff., u.t. as canonical t. of quantum 
mechanics 98, u. representation of 


group 137 ff. 


istics 151, 163, connection with ro- 
tation group bs 144, augmented 146; 
—ina-space 139 ff., reduction of (u)/ 
and algebra of symmetric transforma- 
tions 285, characteristics 331, 381, 
completeness 381. 


Unitary-orthogonal system of vectors 
or functions’ 19, 33, completeness 33, 
on group manifold 158. 


Valence 342, 369, v. electron 86, 243 


Vector, v. space, v. geometry 1 ff., in 
Hilbert space 31 ff., v. field 20, co- 
variant and contravariant 13, absolute 
magnitude 16, dual 17, scalar pro- 
duct 16, unitary-orthogonal v. or 
system 16, 19, as element of Abelian 
group 134; — 3-v. operator in quantum 
mechanics 197, selection and intensity 
rules 198 ff., complete system of 
orthogonal v. in 3-space 257, Vv. 
potential of electro-magnetic field 98. 


Vector model of atom, Hund’s ror. 
Velocity, phase and group 53. 


Volume, measure of, on manifold of 
closed continuous group 160, for 
unitary group 386, for unitary uni- 
modular group 162, 389. 


Wave equation, de_ Broglie’s §3, 
Schrédinger’s §4 ff., 101, Dirac’s 213, 
218, 225. 


Wave field, Heisenberg-Pauli quantiza- - 
tion of 253 ff. 


Wave length 53. 
Wedderburn’s theorem 313. 
Wentzel 74. 

Wien 41. 

Wigner 280, 320. 
Wintner 309. 


Young, A. 358. 
Young’s symmetry operator 359. 


Zeeman effect, normal 85, 101, 193, 198, 
anomalous 198, 204, 208, 223, for 
doublets 204, for multiplets in gene- 
ral 208 ff. 


